Overview
Brought to you by YData
Dataset statistics
| Number of variables | 98 |
|---|---|
| Number of observations | 584201 |
| Missing cells | 14445452 |
| Missing cells (%) | 25.2% |
| Total size in memory | 436.8 MiB |
| Average record size in memory | 784.0 B |
Variable types
| Text | 98 |
|---|
Dataset
| Description | Herpetology NMNH Extant Specimen Records 0054921-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.rf2che |
license has constant value "CC0_1_0" | Constant |
publisher has constant value "National Museum of Natural History, Smithsonian Institution" | Constant |
institutionID has constant value "urn:lsid:biocol.org:col:34871" | Constant |
collectionID has constant value "urn:uuid:cc104cbf-fd8e-4801-9b71-36731a7db1a0" | Constant |
institutionCode has constant value "USNM" | Constant |
collectionCode has constant value "HERP" | Constant |
datasetName has constant value "NMNH Extant Biology" | Constant |
occurrenceStatus has constant value "PRESENT" | Constant |
kingdom has constant value "Animalia" | Constant |
phylum has constant value "Chordata" | Constant |
datasetKey has constant value "821cc27a-e3bb-4bc5-ac34-89ada245069d" | Constant |
publishingCountry has constant value "US" | Constant |
kingdomKey has constant value "1" | Constant |
phylumKey has constant value "44" | Constant |
protocol has constant value "EML" | Constant |
lastCrawled has constant value "2024-12-02T11:48:23.416Z" | Constant |
publishedByGbifRegion has constant value "NORTH_AMERICA" | Constant |
recordNumber has 583925 (> 99.9%) missing values | Missing |
sex has 531942 (91.1%) missing values | Missing |
lifeStage has 542754 (92.9%) missing values | Missing |
associatedSequences has 583480 (99.9%) missing values | Missing |
occurrenceRemarks has 557618 (95.4%) missing values | Missing |
fieldNumber has 584193 (> 99.9%) missing values | Missing |
eventDate has 39140 (6.7%) missing values | Missing |
startDayOfYear has 86170 (14.8%) missing values | Missing |
endDayOfYear has 86170 (14.8%) missing values | Missing |
year has 39600 (6.8%) missing values | Missing |
month has 59025 (10.1%) missing values | Missing |
day has 100844 (17.3%) missing values | Missing |
continent has 10069 (1.7%) missing values | Missing |
waterBody has 555994 (95.2%) missing values | Missing |
islandGroup has 564324 (96.6%) missing values | Missing |
island has 576136 (98.6%) missing values | Missing |
countryCode has 10837 (1.9%) missing values | Missing |
stateProvince has 17001 (2.9%) missing values | Missing |
county has 191557 (32.8%) missing values | Missing |
verbatimElevation has 331608 (56.8%) missing values | Missing |
decimalLatitude has 162667 (27.8%) missing values | Missing |
decimalLongitude has 162667 (27.8%) missing values | Missing |
coordinateUncertaintyInMeters has 439218 (75.2%) missing values | Missing |
georeferenceProtocol has 439136 (75.2%) missing values | Missing |
georeferenceRemarks has 443625 (75.9%) missing values | Missing |
identificationQualifier has 583784 (99.9%) missing values | Missing |
typeStatus has 571070 (97.8%) missing values | Missing |
identifiedBy has 584125 (> 99.9%) missing values | Missing |
order has 189040 (32.4%) missing values | Missing |
specificEpithet has 15011 (2.6%) missing values | Missing |
infraspecificEpithet has 559230 (95.7%) missing values | Missing |
elevation has 332110 (56.8%) missing values | Missing |
elevationAccuracy has 333288 (57.1%) missing values | Missing |
distanceFromCentroidInMeters has 581727 (99.6%) missing values | Missing |
mediaType has 579082 (99.1%) missing values | Missing |
orderKey has 189040 (32.4%) missing values | Missing |
speciesKey has 15011 (2.6%) missing values | Missing |
species has 15011 (2.6%) missing values | Missing |
repatriated has 10596 (1.8%) missing values | Missing |
gbifRegion has 11409 (2.0%) missing values | Missing |
level0Gid has 173676 (29.7%) missing values | Missing |
level0Name has 173676 (29.7%) missing values | Missing |
level1Gid has 174349 (29.8%) missing values | Missing |
level1Name has 174349 (29.8%) missing values | Missing |
level2Gid has 186113 (31.9%) missing values | Missing |
level2Name has 186171 (31.9%) missing values | Missing |
level3Gid has 532468 (91.1%) missing values | Missing |
level3Name has 532843 (91.2%) missing values | Missing |
iucnRedListCategory has 23468 (4.0%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
catalogNumber has unique values | Unique |
Reproduction
| Analysis started | 2025-01-08 22:55:34.710458 |
|---|---|
| Analysis finished | 2025-01-08 22:55:58.312133 |
| Duration | 23.6 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 584201 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 584201 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1317203362 |
|---|---|
| 2nd row | 1317203927 |
| 3rd row | 1317204107 |
| 4th row | 1322537851 |
| 5th row | 1322539748 |
| Value | Count | Frequency (%) |
| 1317203362 | 1 | < 0.1% |
| 1322539748 | 1 | < 0.1% |
| 1322560470 | 1 | < 0.1% |
| 1322558547 | 1 | < 0.1% |
| 1317274722 | 1 | < 0.1% |
| 1317214758 | 1 | < 0.1% |
| 1317204107 | 1 | < 0.1% |
| 1322537851 | 1 | < 0.1% |
| 1317211425 | 1 | < 0.1% |
| 1322569185 | 1 | < 0.1% |
| Other values (584191) | 584191 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1289572 | |
| 3 | 931906 | |
| 2 | 745858 | |
| 8 | 464209 | 7.9% |
| 9 | 461174 | 7.9% |
| 0 | 439271 | 7.5% |
| 7 | 430436 | 7.4% |
| 4 | 371688 | 6.4% |
| 5 | 355028 | 6.1% |
| 6 | 352868 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5842010 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1289572 | |
| 3 | 931906 | |
| 2 | 745858 | |
| 8 | 464209 | 7.9% |
| 9 | 461174 | 7.9% |
| 0 | 439271 | 7.5% |
| 7 | 430436 | 7.4% |
| 4 | 371688 | 6.4% |
| 5 | 355028 | 6.1% |
| 6 | 352868 | 6.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5842010 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1289572 | |
| 3 | 931906 | |
| 2 | 745858 | |
| 8 | 464209 | 7.9% |
| 9 | 461174 | 7.9% |
| 0 | 439271 | 7.5% |
| 7 | 430436 | 7.4% |
| 4 | 371688 | 6.4% |
| 5 | 355028 | 6.1% |
| 6 | 352868 | 6.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5842010 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1289572 | |
| 3 | 931906 | |
| 2 | 745858 | |
| 8 | 464209 | 7.9% |
| 9 | 461174 | 7.9% |
| 0 | 439271 | 7.5% |
| 7 | 430436 | 7.4% |
| 4 | 371688 | 6.4% |
| 5 | 355028 | 6.1% |
| 6 | 352868 | 6.0% |
license
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CC0_1_0 |
|---|---|
| 2nd row | CC0_1_0 |
| 3rd row | CC0_1_0 |
| 4th row | CC0_1_0 |
| 5th row | CC0_1_0 |
| Value | Count | Frequency (%) |
| cc0_1_0 | 584201 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 1168402 | |
| 0 | 1168402 | |
| _ | 1168402 | |
| 1 | 584201 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1752603 | |
| Uppercase Letter | 1168402 | |
| Connector Punctuation | 1168402 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1168402 | |
| 1 | 584201 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1168402 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1168402 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2921005 | |
| Latin | 1168402 | 28.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1168402 | |
| _ | 1168402 | |
| 1 | 584201 |
Latin
| Value | Count | Frequency (%) |
| C | 1168402 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4089407 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 1168402 | |
| 0 | 1168402 | |
| _ | 1168402 | |
| 1 | 584201 |
modified
Text
| Distinct | 11116 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 20 |
| Min length | 20 |
Unique
| Unique | 6239 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | 2022-03-25T16:29:00Z |
|---|---|
| 2nd row | 2022-12-14T12:20:00Z |
| 3rd row | 2022-07-25T13:54:00Z |
| 4th row | 2022-03-25T16:12:00Z |
| 5th row | 2022-03-25T16:41:00Z |
| Value | Count | Frequency (%) |
| 2022-08-17t10:53:00z | 3308 | 0.6% |
| 2022-08-17t10:58:00z | 3292 | 0.6% |
| 2022-08-17t10:59:00z | 3292 | 0.6% |
| 2022-08-17t10:54:00z | 3283 | 0.6% |
| 2022-08-17t10:57:00z | 3269 | 0.6% |
| 2022-08-17t10:56:00z | 3263 | 0.6% |
| 2022-08-17t11:00:00z | 3247 | 0.6% |
| 2022-08-17t11:01:00z | 3245 | 0.6% |
| 2022-08-17t11:03:00z | 3243 | 0.6% |
| 2022-08-17t11:15:00z | 3237 | 0.6% |
| Other values (11106) | 551522 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2825947 | |
| 2 | 1945269 | |
| 1 | 1362306 | |
| - | 1168402 | |
| : | 1168402 | |
| T | 584201 | 5.0% |
| Z | 584201 | 5.0% |
| 8 | 454189 | 3.9% |
| 5 | 397958 | 3.4% |
| 3 | 369937 | 3.2% |
| Other values (4) | 823208 | 7.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8178814 | |
| Dash Punctuation | 1168402 | 10.0% |
| Other Punctuation | 1168402 | 10.0% |
| Uppercase Letter | 1168402 | 10.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2825947 | |
| 2 | 1945269 | |
| 1 | 1362306 | |
| 8 | 454189 | 5.6% |
| 5 | 397958 | 4.9% |
| 3 | 369937 | 4.5% |
| 7 | 249238 | 3.0% |
| 4 | 229171 | 2.8% |
| 6 | 174007 | 2.1% |
| 9 | 170792 | 2.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 584201 | |
| Z | 584201 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1168402 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1168402 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10515618 | |
| Latin | 1168402 | 10.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2825947 | |
| 2 | 1945269 | |
| 1 | 1362306 | |
| - | 1168402 | |
| : | 1168402 | |
| 8 | 454189 | 4.3% |
| 5 | 397958 | 3.8% |
| 3 | 369937 | 3.5% |
| 7 | 249238 | 2.4% |
| 4 | 229171 | 2.2% |
| Other values (2) | 344799 | 3.3% |
Latin
| Value | Count | Frequency (%) |
| T | 584201 | |
| Z | 584201 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11684020 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2825947 | |
| 2 | 1945269 | |
| 1 | 1362306 | |
| - | 1168402 | |
| : | 1168402 | |
| T | 584201 | 5.0% |
| Z | 584201 | 5.0% |
| 8 | 454189 | 3.9% |
| 5 | 397958 | 3.4% |
| 3 | 369937 | 3.2% |
| Other values (4) | 823208 | 7.0% |
publisher
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 59 |
|---|---|
| Median length | 59 |
| Mean length | 59 |
| Min length | 59 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | National Museum of Natural History, Smithsonian Institution |
|---|---|
| 2nd row | National Museum of Natural History, Smithsonian Institution |
| 3rd row | National Museum of Natural History, Smithsonian Institution |
| 4th row | National Museum of Natural History, Smithsonian Institution |
| 5th row | National Museum of Natural History, Smithsonian Institution |
| Value | Count | Frequency (%) |
| national | 584201 | |
| museum | 584201 | |
| of | 584201 | |
| natural | 584201 | |
| history | 584201 | |
| smithsonian | 584201 | |
| institution | 584201 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 4089407 | |
| i | 3505206 | |
| 3505206 | ||
| a | 2921005 | 8.5% |
| o | 2921005 | 8.5% |
| n | 2921005 | 8.5% |
| s | 2336804 | 6.8% |
| u | 2336804 | 6.8% |
| r | 1168402 | 3.4% |
| m | 1168402 | 3.4% |
| Other values (11) | 7594613 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 26873246 | |
| Space Separator | 3505206 | 10.2% |
| Uppercase Letter | 3505206 | 10.2% |
| Other Punctuation | 584201 | 1.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 4089407 | |
| i | 3505206 | |
| a | 2921005 | |
| o | 2921005 | |
| n | 2921005 | |
| s | 2336804 | |
| u | 2336804 | |
| r | 1168402 | 4.3% |
| m | 1168402 | 4.3% |
| l | 1168402 | 4.3% |
| Other values (4) | 2336804 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1168402 | |
| M | 584201 | |
| H | 584201 | |
| S | 584201 | |
| I | 584201 |
Space Separator
| Value | Count | Frequency (%) |
| 3505206 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 584201 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 30378452 | |
| Common | 4089407 | 11.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 4089407 | |
| i | 3505206 | |
| a | 2921005 | |
| o | 2921005 | |
| n | 2921005 | |
| s | 2336804 | 7.7% |
| u | 2336804 | 7.7% |
| r | 1168402 | 3.8% |
| m | 1168402 | 3.8% |
| N | 1168402 | 3.8% |
| Other values (9) | 5842010 |
Common
| Value | Count | Frequency (%) |
| 3505206 | ||
| , | 584201 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34467859 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 4089407 | |
| i | 3505206 | |
| 3505206 | ||
| a | 2921005 | 8.5% |
| o | 2921005 | 8.5% |
| n | 2921005 | 8.5% |
| s | 2336804 | 6.8% |
| u | 2336804 | 6.8% |
| r | 1168402 | 3.4% |
| m | 1168402 | 3.4% |
| Other values (11) | 7594613 |
institutionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 29 |
| Mean length | 29 |
| Min length | 29 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:lsid:biocol.org:col:34871 |
|---|---|
| 2nd row | urn:lsid:biocol.org:col:34871 |
| 3rd row | urn:lsid:biocol.org:col:34871 |
| 4th row | urn:lsid:biocol.org:col:34871 |
| 5th row | urn:lsid:biocol.org:col:34871 |
| Value | Count | Frequency (%) |
| urn:lsid:biocol.org:col:34871 | 584201 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2336804 | |
| : | 2336804 | |
| l | 1752603 | 10.3% |
| i | 1168402 | 6.9% |
| r | 1168402 | 6.9% |
| c | 1168402 | 6.9% |
| g | 584201 | 3.4% |
| 7 | 584201 | 3.4% |
| 8 | 584201 | 3.4% |
| 4 | 584201 | 3.4% |
| Other values (8) | 4673608 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11099819 | |
| Other Punctuation | 2921005 | 17.2% |
| Decimal Number | 2921005 | 17.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 2336804 | |
| l | 1752603 | |
| i | 1168402 | |
| r | 1168402 | |
| c | 1168402 | |
| g | 584201 | 5.3% |
| u | 584201 | 5.3% |
| b | 584201 | 5.3% |
| d | 584201 | 5.3% |
| s | 584201 | 5.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 584201 | |
| 8 | 584201 | |
| 4 | 584201 | |
| 3 | 584201 | |
| 1 | 584201 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 2336804 | |
| . | 584201 | 20.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11099819 | |
| Common | 5842010 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 2336804 | |
| l | 1752603 | |
| i | 1168402 | |
| r | 1168402 | |
| c | 1168402 | |
| g | 584201 | 5.3% |
| u | 584201 | 5.3% |
| b | 584201 | 5.3% |
| d | 584201 | 5.3% |
| s | 584201 | 5.3% |
Common
| Value | Count | Frequency (%) |
| : | 2336804 | |
| 7 | 584201 | 10.0% |
| 8 | 584201 | 10.0% |
| 4 | 584201 | 10.0% |
| 3 | 584201 | 10.0% |
| . | 584201 | 10.0% |
| 1 | 584201 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16941829 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 2336804 | |
| : | 2336804 | |
| l | 1752603 | 10.3% |
| i | 1168402 | 6.9% |
| r | 1168402 | 6.9% |
| c | 1168402 | 6.9% |
| g | 584201 | 3.4% |
| 7 | 584201 | 3.4% |
| 8 | 584201 | 3.4% |
| 4 | 584201 | 3.4% |
| Other values (8) | 4673608 |
collectionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 45 |
| Mean length | 45 |
| Min length | 45 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:uuid:cc104cbf-fd8e-4801-9b71-36731a7db1a0 |
|---|---|
| 2nd row | urn:uuid:cc104cbf-fd8e-4801-9b71-36731a7db1a0 |
| 3rd row | urn:uuid:cc104cbf-fd8e-4801-9b71-36731a7db1a0 |
| 4th row | urn:uuid:cc104cbf-fd8e-4801-9b71-36731a7db1a0 |
| 5th row | urn:uuid:cc104cbf-fd8e-4801-9b71-36731a7db1a0 |
| Value | Count | Frequency (%) |
| urn:uuid:cc104cbf-fd8e-4801-9b71-36731a7db1a0 | 584201 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2921005 | 11.1% |
| - | 2336804 | 8.9% |
| u | 1752603 | 6.7% |
| c | 1752603 | 6.7% |
| 7 | 1752603 | 6.7% |
| 0 | 1752603 | 6.7% |
| b | 1752603 | 6.7% |
| d | 1752603 | 6.7% |
| 4 | 1168402 | 4.4% |
| f | 1168402 | 4.4% |
| Other values (10) | 8178814 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11684020 | |
| Decimal Number | 11099819 | |
| Dash Punctuation | 2336804 | 8.9% |
| Other Punctuation | 1168402 | 4.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 1752603 | |
| c | 1752603 | |
| b | 1752603 | |
| d | 1752603 | |
| f | 1168402 | |
| a | 1168402 | |
| i | 584201 | 5.0% |
| r | 584201 | 5.0% |
| e | 584201 | 5.0% |
| n | 584201 | 5.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2921005 | |
| 7 | 1752603 | |
| 0 | 1752603 | |
| 4 | 1168402 | 10.5% |
| 8 | 1168402 | 10.5% |
| 3 | 1168402 | 10.5% |
| 9 | 584201 | 5.3% |
| 6 | 584201 | 5.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2336804 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1168402 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14605025 | |
| Latin | 11684020 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2921005 | |
| - | 2336804 | |
| 7 | 1752603 | |
| 0 | 1752603 | |
| 4 | 1168402 | 8.0% |
| : | 1168402 | 8.0% |
| 8 | 1168402 | 8.0% |
| 3 | 1168402 | 8.0% |
| 9 | 584201 | 4.0% |
| 6 | 584201 | 4.0% |
Latin
| Value | Count | Frequency (%) |
| u | 1752603 | |
| c | 1752603 | |
| b | 1752603 | |
| d | 1752603 | |
| f | 1168402 | |
| a | 1168402 | |
| i | 584201 | 5.0% |
| r | 584201 | 5.0% |
| e | 584201 | 5.0% |
| n | 584201 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26289045 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2921005 | 11.1% |
| - | 2336804 | 8.9% |
| u | 1752603 | 6.7% |
| c | 1752603 | 6.7% |
| 7 | 1752603 | 6.7% |
| 0 | 1752603 | 6.7% |
| b | 1752603 | 6.7% |
| d | 1752603 | 6.7% |
| 4 | 1168402 | 4.4% |
| f | 1168402 | 4.4% |
| Other values (10) | 8178814 |
institutionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | USNM |
|---|---|
| 2nd row | USNM |
| 3rd row | USNM |
| 4th row | USNM |
| 5th row | USNM |
| Value | Count | Frequency (%) |
| usnm | 584201 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 584201 | |
| S | 584201 | |
| N | 584201 | |
| M | 584201 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2336804 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 584201 | |
| S | 584201 | |
| N | 584201 | |
| M | 584201 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2336804 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 584201 | |
| S | 584201 | |
| N | 584201 | |
| M | 584201 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2336804 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 584201 | |
| S | 584201 | |
| N | 584201 | |
| M | 584201 |
collectionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | HERP |
|---|---|
| 2nd row | HERP |
| 3rd row | HERP |
| 4th row | HERP |
| 5th row | HERP |
| Value | Count | Frequency (%) |
| herp | 584201 |
Most occurring characters
| Value | Count | Frequency (%) |
| H | 584201 | |
| E | 584201 | |
| R | 584201 | |
| P | 584201 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2336804 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 584201 | |
| E | 584201 | |
| R | 584201 | |
| P | 584201 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2336804 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| H | 584201 | |
| E | 584201 | |
| R | 584201 | |
| P | 584201 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2336804 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| H | 584201 | |
| E | 584201 | |
| R | 584201 | |
| P | 584201 |
datasetName
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NMNH Extant Biology |
|---|---|
| 2nd row | NMNH Extant Biology |
| 3rd row | NMNH Extant Biology |
| 4th row | NMNH Extant Biology |
| 5th row | NMNH Extant Biology |
| Value | Count | Frequency (%) |
| nmnh | 584201 | |
| extant | 584201 | |
| biology | 584201 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1168402 | 10.5% |
| 1168402 | 10.5% | |
| t | 1168402 | 10.5% |
| o | 1168402 | 10.5% |
| M | 584201 | 5.3% |
| H | 584201 | 5.3% |
| E | 584201 | 5.3% |
| x | 584201 | 5.3% |
| a | 584201 | 5.3% |
| n | 584201 | 5.3% |
| Other values (5) | 2921005 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6426211 | |
| Uppercase Letter | 3505206 | |
| Space Separator | 1168402 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1168402 | |
| o | 1168402 | |
| x | 584201 | |
| a | 584201 | |
| n | 584201 | |
| i | 584201 | |
| l | 584201 | |
| g | 584201 | |
| y | 584201 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1168402 | |
| M | 584201 | |
| H | 584201 | |
| E | 584201 | |
| B | 584201 |
Space Separator
| Value | Count | Frequency (%) |
| 1168402 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9931417 | |
| Common | 1168402 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 1168402 | |
| t | 1168402 | |
| o | 1168402 | |
| M | 584201 | 5.9% |
| H | 584201 | 5.9% |
| E | 584201 | 5.9% |
| x | 584201 | 5.9% |
| a | 584201 | 5.9% |
| n | 584201 | 5.9% |
| B | 584201 | 5.9% |
| Other values (4) | 2336804 |
Common
| Value | Count | Frequency (%) |
| 1168402 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11099819 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 1168402 | 10.5% |
| 1168402 | 10.5% | |
| t | 1168402 | 10.5% |
| o | 1168402 | 10.5% |
| M | 584201 | 5.3% |
| H | 584201 | 5.3% |
| E | 584201 | 5.3% |
| x | 584201 | 5.3% |
| a | 584201 | 5.3% |
| n | 584201 | 5.3% |
| Other values (5) | 2921005 |
basisOfRecord
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 18 |
| Mean length | 18.00021739 |
| Min length | 18 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESERVED_SPECIMEN |
|---|---|
| 2nd row | PRESERVED_SPECIMEN |
| 3rd row | PRESERVED_SPECIMEN |
| 4th row | PRESERVED_SPECIMEN |
| 5th row | PRESERVED_SPECIMEN |
| Value | Count | Frequency (%) |
| preserved_specimen | 584074 | |
| machine_observation | 127 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 2920624 | |
| R | 1168275 | |
| S | 1168275 | |
| P | 1168148 | 11.1% |
| I | 584328 | 5.6% |
| N | 584328 | 5.6% |
| V | 584201 | 5.6% |
| _ | 584201 | 5.6% |
| C | 584201 | 5.6% |
| M | 584201 | 5.6% |
| Other values (6) | 584963 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 9931544 | |
| Connector Punctuation | 584201 | 5.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 2920624 | |
| R | 1168275 | |
| S | 1168275 | |
| P | 1168148 | 11.8% |
| I | 584328 | 5.9% |
| N | 584328 | 5.9% |
| V | 584201 | 5.9% |
| C | 584201 | 5.9% |
| M | 584201 | 5.9% |
| D | 584074 | 5.9% |
| Other values (5) | 889 | < 0.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 584201 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9931544 | |
| Common | 584201 | 5.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 2920624 | |
| R | 1168275 | |
| S | 1168275 | |
| P | 1168148 | 11.8% |
| I | 584328 | 5.9% |
| N | 584328 | 5.9% |
| V | 584201 | 5.9% |
| C | 584201 | 5.9% |
| M | 584201 | 5.9% |
| D | 584074 | 5.9% |
| Other values (5) | 889 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| _ | 584201 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10515745 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 2920624 | |
| R | 1168275 | |
| S | 1168275 | |
| P | 1168148 | 11.1% |
| I | 584328 | 5.6% |
| N | 584328 | 5.6% |
| V | 584201 | 5.6% |
| _ | 584201 | 5.6% |
| C | 584201 | 5.6% |
| M | 584201 | 5.6% |
| Other values (6) | 584963 | 5.6% |
occurrenceID
Text
Unique 
| Distinct | 584201 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 63 |
| Mean length | 63 |
| Min length | 63 |
Unique
| Unique | 584201 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | http://n2t.net/ark:/65665/3000ac9b1-ec0b-4be2-939f-464ad355cc84 |
|---|---|
| 2nd row | http://n2t.net/ark:/65665/30010adfb-58e1-4e98-8d39-ee055b3463fa |
| 3rd row | http://n2t.net/ark:/65665/30012ab17-d2a1-470c-a774-540bc6cffb00 |
| 4th row | http://n2t.net/ark:/65665/3ec02d332-deb7-4b55-ba3d-5a5d6ca577c9 |
| 5th row | http://n2t.net/ark:/65665/3ec19a125-2484-4fa3-b6b7-7d87199a6994 |
| Value | Count | Frequency (%) |
| http://n2t.net/ark:/65665/3000ac9b1-ec0b-4be2-939f-464ad355cc84 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ec19a125-2484-4fa3-b6b7-7d87199a6994 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ed02751f-656c-458c-80fa-90bf891a2063 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3eced04ac-39a4-455a-85e7-7cb0b4299f6b | 1 | < 0.1% |
| http://n2t.net/ark:/65665/303348f04-82b4-456c-be8d-764af3205229 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3008b1b21-05b1-4e8d-b34c-1e3a96daecf7 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/30012ab17-d2a1-470c-a774-540bc6cffb00 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ec02d332-deb7-4b55-ba3d-5a5d6ca577c9 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3006575b6-ca0a-42bd-b75d-3241cc3e332d | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ed66e63b-4fff-4639-8abf-a635d31dd047 | 1 | < 0.1% |
| Other values (584191) | 584191 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 2921005 | 7.9% |
| 6 | 2847614 | 7.7% |
| - | 2336804 | 6.3% |
| t | 2336804 | 6.3% |
| 5 | 2265995 | 6.2% |
| a | 1826256 | 5.0% |
| e | 1681096 | 4.6% |
| 2 | 1680524 | 4.6% |
| 3 | 1680017 | 4.6% |
| 4 | 1678083 | 4.6% |
| Other values (16) | 15550465 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15919515 | |
| Lowercase Letter | 13874736 | |
| Other Punctuation | 4673608 | 12.7% |
| Dash Punctuation | 2336804 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 2336804 | |
| a | 1826256 | |
| e | 1681096 | |
| b | 1241364 | |
| n | 1168402 | |
| c | 1094913 | |
| f | 1094889 | |
| d | 1094208 | |
| k | 584201 | 4.2% |
| r | 584201 | 4.2% |
| Other values (2) | 1168402 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 2847614 | |
| 5 | 2265995 | |
| 2 | 1680524 | |
| 3 | 1680017 | |
| 4 | 1678083 | |
| 9 | 1244007 | |
| 8 | 1240305 | |
| 1 | 1096638 | 6.9% |
| 7 | 1094431 | 6.9% |
| 0 | 1091901 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 2921005 | |
| : | 1168402 | 25.0% |
| . | 584201 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2336804 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 22929927 | |
| Latin | 13874736 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 2921005 | |
| 6 | 2847614 | |
| - | 2336804 | |
| 5 | 2265995 | |
| 2 | 1680524 | |
| 3 | 1680017 | |
| 4 | 1678083 | |
| 9 | 1244007 | 5.4% |
| 8 | 1240305 | 5.4% |
| : | 1168402 | 5.1% |
| Other values (4) | 3867171 |
Latin
| Value | Count | Frequency (%) |
| t | 2336804 | |
| a | 1826256 | |
| e | 1681096 | |
| b | 1241364 | |
| n | 1168402 | |
| c | 1094913 | |
| f | 1094889 | |
| d | 1094208 | |
| k | 584201 | 4.2% |
| r | 584201 | 4.2% |
| Other values (2) | 1168402 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36804663 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 2921005 | 7.9% |
| 6 | 2847614 | 7.7% |
| - | 2336804 | 6.3% |
| t | 2336804 | 6.3% |
| 5 | 2265995 | 6.2% |
| a | 1826256 | 5.0% |
| e | 1681096 | 4.6% |
| 2 | 1680524 | 4.6% |
| 3 | 1680017 | 4.6% |
| 4 | 1678083 | 4.6% |
| Other values (16) | 15550465 |
catalogNumber
Text
Unique 
| Distinct | 584201 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 11 |
| Mean length | 10.93256944 |
| Min length | 6 |
Unique
| Unique | 584201 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | USNM 231889 |
|---|---|
| 2nd row | USNM 487703 |
| 3rd row | USNM 297347 |
| 4th row | USNM 322261 |
| 5th row | USNM 319170 |
| Value | Count | Frequency (%) |
| usnm | 584201 | |
| herp | 5833 | 0.5% |
| tissue | 5706 | 0.5% |
| image | 127 | < 0.1% |
| 2847 | 3 | < 0.1% |
| 2877 | 3 | < 0.1% |
| 2872 | 3 | < 0.1% |
| 2940 | 3 | < 0.1% |
| 2715 | 3 | < 0.1% |
| 9 | 3 | < 0.1% |
| Other values (581072) | 584183 |
Most occurring characters
| Value | Count | Frequency (%) |
| 595867 | 9.3% | |
| U | 584201 | 9.1% |
| N | 584201 | 9.1% |
| M | 584201 | 9.1% |
| S | 584201 | 9.1% |
| 4 | 393545 | 6.2% |
| 2 | 393142 | 6.2% |
| 3 | 392798 | 6.2% |
| 1 | 391284 | 6.1% |
| 5 | 383581 | 6.0% |
| Other values (17) | 1499797 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3395944 | |
| Uppercase Letter | 2348470 | |
| Space Separator | 595867 | 9.3% |
| Lowercase Letter | 46537 | 0.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 393545 | |
| 2 | 393142 | |
| 3 | 392798 | |
| 1 | 391284 | |
| 5 | 383581 | |
| 6 | 292686 | |
| 7 | 291064 | |
| 8 | 290326 | |
| 9 | 285200 | |
| 0 | 282318 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 11666 | |
| s | 11412 | |
| r | 5833 | |
| p | 5833 | |
| i | 5706 | |
| u | 5706 | |
| m | 127 | 0.3% |
| a | 127 | 0.3% |
| g | 127 | 0.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 584201 | |
| N | 584201 | |
| M | 584201 | |
| S | 584201 | |
| H | 5833 | 0.2% |
| T | 5706 | 0.2% |
| I | 127 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 595867 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3991811 | |
| Latin | 2395007 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 584201 | |
| N | 584201 | |
| M | 584201 | |
| S | 584201 | |
| e | 11666 | 0.5% |
| s | 11412 | 0.5% |
| H | 5833 | 0.2% |
| r | 5833 | 0.2% |
| p | 5833 | 0.2% |
| T | 5706 | 0.2% |
| Other values (6) | 11920 | 0.5% |
Common
| Value | Count | Frequency (%) |
| 595867 | ||
| 4 | 393545 | |
| 2 | 393142 | |
| 3 | 392798 | |
| 1 | 391284 | |
| 5 | 383581 | |
| 6 | 292686 | |
| 7 | 291064 | |
| 8 | 290326 | |
| 9 | 285200 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6386818 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 595867 | 9.3% | |
| U | 584201 | 9.1% |
| N | 584201 | 9.1% |
| M | 584201 | 9.1% |
| S | 584201 | 9.1% |
| 4 | 393545 | 6.2% |
| 2 | 393142 | 6.2% |
| 3 | 392798 | 6.2% |
| 1 | 391284 | 6.1% |
| 5 | 383581 | 6.0% |
| Other values (17) | 1499797 |
recordNumber
Text
Missing 
| Distinct | 273 |
|---|---|
| Distinct (%) | 98.9% |
| Missing | 583925 |
| Missing (%) | > 99.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.460144928 |
| Min length | 1 |
Unique
| Unique | 271 ? |
|---|---|
| Unique (%) | 98.2% |
Sample
| 1st row | RWM 20004 |
|---|---|
| 2nd row | RWM 19953 |
| 3rd row | RWM 19978 |
| 4th row | RWM 19932 |
| 5th row | RWM 19955 |
| Value | Count | Frequency (%) |
| rwm | 182 | |
| gmu | 74 | 13.5% |
| lc | 15 | 2.7% |
| 8 | 3 | 0.5% |
| 19897 | 2 | 0.4% |
| 19895 | 1 | 0.2% |
| 19926 | 1 | 0.2% |
| 2430 | 1 | 0.2% |
| 19973 | 1 | 0.2% |
| 19925 | 1 | 0.2% |
| Other values (267) | 267 |
Most occurring characters
| Value | Count | Frequency (%) |
| 272 | ||
| 9 | 260 | |
| M | 257 | |
| 0 | 245 | |
| 1 | 190 | |
| W | 182 | |
| R | 182 | |
| 2 | 165 | |
| 3 | 95 | 4.1% |
| G | 75 | 3.2% |
| Other values (9) | 412 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1262 | |
| Uppercase Letter | 801 | |
| Space Separator | 272 | 11.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 260 | |
| 0 | 245 | |
| 1 | 190 | |
| 2 | 165 | |
| 3 | 95 | 7.5% |
| 7 | 71 | 5.6% |
| 6 | 63 | 5.0% |
| 4 | 62 | 4.9% |
| 8 | 57 | 4.5% |
| 5 | 54 | 4.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 257 | |
| W | 182 | |
| R | 182 | |
| G | 75 | 9.4% |
| U | 74 | 9.2% |
| C | 15 | 1.9% |
| L | 15 | 1.9% |
| D | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 272 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1534 | |
| Latin | 801 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 272 | ||
| 9 | 260 | |
| 0 | 245 | |
| 1 | 190 | |
| 2 | 165 | |
| 3 | 95 | 6.2% |
| 7 | 71 | 4.6% |
| 6 | 63 | 4.1% |
| 4 | 62 | 4.0% |
| 8 | 57 | 3.7% |
Latin
| Value | Count | Frequency (%) |
| M | 257 | |
| W | 182 | |
| R | 182 | |
| G | 75 | 9.4% |
| U | 74 | 9.2% |
| C | 15 | 1.9% |
| L | 15 | 1.9% |
| D | 1 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2335 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 272 | ||
| 9 | 260 | |
| M | 257 | |
| 0 | 245 | |
| 1 | 190 | |
| W | 182 | |
| R | 182 | |
| 2 | 165 | |
| 3 | 95 | 4.1% |
| G | 75 | 3.2% |
| Other values (9) | 412 |
individualCount
Text
| Distinct | 158 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 1 |
| Mean length | 1.004863086 |
| Min length | 1 |
Unique
| Unique | 51 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 576101 | |
| 2 | 1312 | 0.2% |
| 0 | 1007 | 0.2% |
| 3 | 830 | 0.1% |
| 5 | 523 | 0.1% |
| 4 | 522 | 0.1% |
| 6 | 386 | 0.1% |
| 7 | 339 | 0.1% |
| 8 | 271 | < 0.1% |
| 10 | 257 | < 0.1% |
| Other values (148) | 2649 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 577649 | |
| 2 | 2199 | 0.4% |
| 0 | 2065 | 0.4% |
| 3 | 1313 | 0.2% |
| 5 | 1043 | 0.2% |
| 4 | 852 | 0.1% |
| 6 | 611 | 0.1% |
| 7 | 518 | 0.1% |
| 8 | 428 | 0.1% |
| 9 | 360 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 587038 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 577649 | |
| 2 | 2199 | 0.4% |
| 0 | 2065 | 0.4% |
| 3 | 1313 | 0.2% |
| 5 | 1043 | 0.2% |
| 4 | 852 | 0.1% |
| 6 | 611 | 0.1% |
| 7 | 518 | 0.1% |
| 8 | 428 | 0.1% |
| 9 | 360 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 587038 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 577649 | |
| 2 | 2199 | 0.4% |
| 0 | 2065 | 0.4% |
| 3 | 1313 | 0.2% |
| 5 | 1043 | 0.2% |
| 4 | 852 | 0.1% |
| 6 | 611 | 0.1% |
| 7 | 518 | 0.1% |
| 8 | 428 | 0.1% |
| 9 | 360 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 587038 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 577649 | |
| 2 | 2199 | 0.4% |
| 0 | 2065 | 0.4% |
| 3 | 1313 | 0.2% |
| 5 | 1043 | 0.2% |
| 4 | 852 | 0.1% |
| 6 | 611 | 0.1% |
| 7 | 518 | 0.1% |
| 8 | 428 | 0.1% |
| 9 | 360 | 0.1% |
sex
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 531942 |
| Missing (%) | 91.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 4 |
| Mean length | 4.859507453 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | MALE |
|---|---|
| 2nd row | MALE |
| 3rd row | FEMALE |
| 4th row | MALE |
| 5th row | FEMALE |
| Value | Count | Frequency (%) |
| male | 29804 | |
| female | 22454 | |
| hermaphrodite | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 74714 | |
| M | 52259 | |
| A | 52259 | |
| L | 52258 | |
| F | 22454 | 8.8% |
| H | 2 | < 0.1% |
| R | 2 | < 0.1% |
| P | 1 | < 0.1% |
| O | 1 | < 0.1% |
| D | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 253953 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 74714 | |
| M | 52259 | |
| A | 52259 | |
| L | 52258 | |
| F | 22454 | 8.8% |
| H | 2 | < 0.1% |
| R | 2 | < 0.1% |
| P | 1 | < 0.1% |
| O | 1 | < 0.1% |
| D | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 253953 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 74714 | |
| M | 52259 | |
| A | 52259 | |
| L | 52258 | |
| F | 22454 | 8.8% |
| H | 2 | < 0.1% |
| R | 2 | < 0.1% |
| P | 1 | < 0.1% |
| O | 1 | < 0.1% |
| D | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 253953 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 74714 | |
| M | 52259 | |
| A | 52259 | |
| L | 52258 | |
| F | 22454 | 8.8% |
| H | 2 | < 0.1% |
| R | 2 | < 0.1% |
| P | 1 | < 0.1% |
| O | 1 | < 0.1% |
| D | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
lifeStage
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 542754 |
| Missing (%) | 92.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 6.744082805 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Larva |
|---|---|
| 2nd row | Egg |
| 3rd row | Larva |
| 4th row | Juvenile |
| 5th row | Juvenile |
| Value | Count | Frequency (%) |
| juvenile | 20321 | |
| larva | 11464 | |
| adult | 3710 | 9.0% |
| hatchling | 2380 | 5.7% |
| embryo | 1048 | 2.5% |
| egg | 838 | 2.0% |
| neonate | 656 | 1.6% |
| subadult | 528 | 1.3% |
| eft | 387 | 0.9% |
| immature | 88 | 0.2% |
| Other values (2) | 27 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 42069 | |
| v | 31785 | |
| l | 26962 | |
| a | 26603 | |
| u | 25179 | |
| n | 23357 | |
| i | 22701 | |
| J | 20321 | |
| r | 12600 | 4.5% |
| L | 11464 | 4.1% |
| Other values (20) | 36481 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 238075 | |
| Uppercase Letter | 41447 | 14.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 42069 | |
| v | 31785 | |
| l | 26962 | |
| a | 26603 | |
| u | 25179 | |
| n | 23357 | |
| i | 22701 | |
| r | 12600 | 5.3% |
| t | 7753 | 3.3% |
| d | 4261 | 1.8% |
| Other values (10) | 14805 | 6.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 20321 | |
| L | 11464 | |
| A | 3710 | 9.0% |
| H | 2380 | 5.7% |
| E | 2273 | 5.5% |
| N | 656 | 1.6% |
| S | 528 | 1.3% |
| I | 88 | 0.2% |
| T | 23 | 0.1% |
| F | 4 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 279522 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 42069 | |
| v | 31785 | |
| l | 26962 | |
| a | 26603 | |
| u | 25179 | |
| n | 23357 | |
| i | 22701 | |
| J | 20321 | |
| r | 12600 | 4.5% |
| L | 11464 | 4.1% |
| Other values (20) | 36481 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 279522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 42069 | |
| v | 31785 | |
| l | 26962 | |
| a | 26603 | |
| u | 25179 | |
| n | 23357 | |
| i | 22701 | |
| J | 20321 | |
| r | 12600 | 4.5% |
| L | 11464 | 4.1% |
| Other values (20) | 36481 |
occurrenceStatus
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESENT |
|---|---|
| 2nd row | PRESENT |
| 3rd row | PRESENT |
| 4th row | PRESENT |
| 5th row | PRESENT |
| Value | Count | Frequency (%) |
| present | 584201 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1168402 | |
| P | 584201 | |
| R | 584201 | |
| S | 584201 | |
| N | 584201 | |
| T | 584201 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4089407 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1168402 | |
| P | 584201 | |
| R | 584201 | |
| S | 584201 | |
| N | 584201 | |
| T | 584201 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4089407 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 1168402 | |
| P | 584201 | |
| R | 584201 | |
| S | 584201 | |
| N | 584201 | |
| T | 584201 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4089407 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 1168402 | |
| P | 584201 | |
| R | 584201 | |
| S | 584201 | |
| N | 584201 | |
| T | 584201 |
preparations
Text
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5684 |
| Missing (%) | 1.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 7 |
| Mean length | 7.117061383 |
| Min length | 3 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Ethanol |
|---|---|
| 2nd row | Ethanol; Histological Material |
| 3rd row | Ethanol; Dry |
| 4th row | Ethanol |
| 5th row | Ethanol |
| Value | Count | Frequency (%) |
| ethanol | 553871 | |
| dry | 13058 | 2.2% |
| formalin | 8143 | 1.4% |
| cleared | 4474 | 0.8% |
| and | 4474 | 0.8% |
| stained | 4474 | 0.8% |
| histological | 2058 | 0.3% |
| material | 2058 | 0.3% |
| photograph | 126 | < 0.1% |
| sem | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 581736 | |
| l | 572662 | |
| n | 570962 | |
| o | 566382 | |
| t | 562587 | |
| h | 554123 | |
| E | 553874 | |
| r | 27859 | 0.7% |
| i | 18791 | 0.5% |
| e | 15480 | 0.4% |
| Other values (16) | 92885 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3511631 | |
| Uppercase Letter | 588271 | 14.3% |
| Space Separator | 14223 | 0.3% |
| Other Punctuation | 3216 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 581736 | |
| l | 572662 | |
| n | 570962 | |
| o | 566382 | |
| t | 562587 | |
| h | 554123 | |
| r | 27859 | 0.8% |
| i | 18791 | 0.5% |
| e | 15480 | 0.4% |
| d | 13422 | 0.4% |
| Other values (6) | 27627 | 0.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 553874 | |
| D | 13058 | 2.2% |
| F | 8143 | 1.4% |
| S | 4477 | 0.8% |
| C | 4474 | 0.8% |
| M | 2061 | 0.4% |
| H | 2058 | 0.3% |
| P | 126 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 14223 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 3216 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4099902 | |
| Common | 17439 | 0.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 581736 | |
| l | 572662 | |
| n | 570962 | |
| o | 566382 | |
| t | 562587 | |
| h | 554123 | |
| E | 553874 | |
| r | 27859 | 0.7% |
| i | 18791 | 0.5% |
| e | 15480 | 0.4% |
| Other values (14) | 75446 | 1.8% |
Common
| Value | Count | Frequency (%) |
| 14223 | ||
| ; | 3216 | 18.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4117341 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 581736 | |
| l | 572662 | |
| n | 570962 | |
| o | 566382 | |
| t | 562587 | |
| h | 554123 | |
| E | 553874 | |
| r | 27859 | 0.7% |
| i | 18791 | 0.5% |
| e | 15480 | 0.4% |
| Other values (16) | 92885 | 2.3% |
Missing 
| Distinct | 719 |
|---|---|
| Distinct (%) | 99.7% |
| Missing | 583480 |
| Missing (%) | 99.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 699 |
|---|---|
| Median length | 99 |
| Mean length | 112.1983356 |
| Min length | 49 |
Unique
| Unique | 717 ? |
|---|---|
| Unique (%) | 99.4% |
Sample
| 1st row | https://www.ncbi.nlm.nih.gov/gquery?term=AF199141;https://www.ncbi.nlm.nih.gov/gquery?term=AF199204 |
|---|---|
| 2nd row | https://www.ncbi.nlm.nih.gov/gquery?term=OM928184;https://www.ncbi.nlm.nih.gov/gquery?term=OM943246 |
| 3rd row | https://www.ncbi.nlm.nih.gov/gquery?term=JQ914700 |
| 4th row | https://www.ncbi.nlm.nih.gov/gquery?term=FJ613461 |
| 5th row | https://www.ncbi.nlm.nih.gov/gquery?term=FJ766602;https://www.ncbi.nlm.nih.gov/gquery?term=FJ784443 |
| Value | Count | Frequency (%) |
| https://www.ncbi.nlm.nih.gov/gquery?term=jn112709;https://www.ncbi.nlm.nih.gov/gquery?term=jn112771;https://www.ncbi.nlm.nih.gov/gquery?term=jn112642 | 2 | 0.3% |
| https://www.ncbi.nlm.nih.gov/gquery?term=ay604497 | 2 | 0.3% |
| https://www.ncbi.nlm.nih.gov/gquery?term=fj976636 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=jn377389;https://www.ncbi.nlm.nih.gov/gquery?term=jn377393;https://www.ncbi.nlm.nih.gov/gquery?term=jn377405 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=kc129216;https://www.ncbi.nlm.nih.gov/gquery?term=kc129324 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=ay604512 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=fj766829;https://www.ncbi.nlm.nih.gov/gquery?term=fj784465 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=om928184;https://www.ncbi.nlm.nih.gov/gquery?term=om943246 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=jq914700 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=fj613461 | 1 | 0.1% |
| Other values (709) | 709 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 6533 | 8.1% |
| t | 4896 | 6.1% |
| / | 4896 | 6.1% |
| w | 4896 | 6.1% |
| n | 4896 | 6.1% |
| h | 3264 | 4.0% |
| r | 3264 | 4.0% |
| i | 3264 | 4.0% |
| e | 3264 | 4.0% |
| m | 3264 | 4.0% |
| Other values (45) | 38458 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 50592 | |
| Other Punctuation | 15604 | 19.3% |
| Decimal Number | 9801 | 12.1% |
| Uppercase Letter | 3266 | 4.0% |
| Math Symbol | 1632 | 2.0% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 653 | |
| F | 619 | |
| M | 450 | |
| K | 434 | |
| A | 177 | 5.4% |
| Y | 150 | 4.6% |
| Q | 129 | 3.9% |
| H | 104 | 3.2% |
| N | 86 | 2.6% |
| O | 76 | 2.3% |
| Other values (10) | 388 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 4896 | 9.7% |
| w | 4896 | 9.7% |
| n | 4896 | 9.7% |
| h | 3264 | 6.5% |
| r | 3264 | 6.5% |
| i | 3264 | 6.5% |
| e | 3264 | 6.5% |
| m | 3264 | 6.5% |
| g | 3264 | 6.5% |
| q | 1632 | 3.2% |
| Other values (9) | 14688 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 1506 | |
| 6 | 1267 | |
| 7 | 1220 | |
| 8 | 1087 | |
| 3 | 960 | |
| 1 | 840 | |
| 5 | 786 | |
| 2 | 763 | |
| 9 | 763 | |
| 0 | 609 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6533 | |
| / | 4896 | |
| ? | 1632 | 10.5% |
| : | 1632 | 10.5% |
| ; | 911 | 5.8% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1632 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 53858 | |
| Common | 27037 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 4896 | 9.1% |
| w | 4896 | 9.1% |
| n | 4896 | 9.1% |
| h | 3264 | 6.1% |
| r | 3264 | 6.1% |
| i | 3264 | 6.1% |
| e | 3264 | 6.1% |
| m | 3264 | 6.1% |
| g | 3264 | 6.1% |
| q | 1632 | 3.0% |
| Other values (29) | 17954 |
Common
| Value | Count | Frequency (%) |
| . | 6533 | |
| / | 4896 | |
| = | 1632 | 6.0% |
| ? | 1632 | 6.0% |
| : | 1632 | 6.0% |
| 4 | 1506 | 5.6% |
| 6 | 1267 | 4.7% |
| 7 | 1220 | 4.5% |
| 8 | 1087 | 4.0% |
| 3 | 960 | 3.6% |
| Other values (6) | 4672 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 80895 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 6533 | 8.1% |
| t | 4896 | 6.1% |
| / | 4896 | 6.1% |
| w | 4896 | 6.1% |
| n | 4896 | 6.1% |
| h | 3264 | 4.0% |
| r | 3264 | 4.0% |
| i | 3264 | 4.0% |
| e | 3264 | 4.0% |
| m | 3264 | 4.0% |
| Other values (45) | 38458 |
Missing 
| Distinct | 5339 |
|---|---|
| Distinct (%) | 20.1% |
| Missing | 557618 |
| Missing (%) | 95.4% |
| Memory size | 4.5 MiB |
Length
| Max length | 1294 |
|---|---|
| Median length | 381 |
| Mean length | 66.70947598 |
| Min length | 3 |
Unique
| Unique | 3351 ? |
|---|---|
| Unique (%) | 12.6% |
Sample
| 1st row | Collected from vegetation removal plot (Cocolob 2) in coastal strand Cocolobo uvifera forest, ca. 10 m inland from beach. |
|---|---|
| 2nd row | Collected in roadside ditch in gum/bay swamp. Water depth: 10-40 cm. |
| 3rd row | Complete clutch of eggs removed from the ovaries of a female (Total Length: 57 inches) collected along wooded road. |
| 4th row | Collected on surface at night. |
| 5th row | Collected above and below the falls, south of the creek. |
| Value | Count | Frequency (%) |
| collected | 21028 | 7.1% |
| in | 15429 | 5.2% |
| of | 11658 | 3.9% |
| the | 11088 | 3.7% |
| on | 10611 | 3.6% |
| from | 7596 | 2.6% |
| and | 5597 | 1.9% |
| at | 5284 | 1.8% |
| area | 4127 | 1.4% |
| road | 4049 | 1.4% |
| Other values (6088) | 200792 |
Most occurring characters
| Value | Count | Frequency (%) |
| 270676 | ||
| e | 160711 | 9.1% |
| o | 140496 | 7.9% |
| a | 114427 | 6.5% |
| t | 108900 | 6.1% |
| l | 98347 | 5.5% |
| n | 89158 | 5.0% |
| r | 81415 | 4.6% |
| d | 76949 | 4.3% |
| i | 72320 | 4.1% |
| Other values (81) | 559939 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1313680 | |
| Space Separator | 270676 | 15.3% |
| Uppercase Letter | 64926 | 3.7% |
| Decimal Number | 55444 | 3.1% |
| Other Punctuation | 51683 | 2.9% |
| Open Punctuation | 5630 | 0.3% |
| Close Punctuation | 5620 | 0.3% |
| Dash Punctuation | 5566 | 0.3% |
| Math Symbol | 113 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 160711 | |
| o | 140496 | |
| a | 114427 | 8.7% |
| t | 108900 | 8.3% |
| l | 98347 | 7.5% |
| n | 89158 | 6.8% |
| r | 81415 | 6.2% |
| d | 76949 | 5.9% |
| i | 72320 | 5.5% |
| s | 64649 | 4.9% |
| Other values (23) | 306308 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 24437 | |
| P | 4884 | 7.5% |
| N | 3946 | 6.1% |
| A | 3756 | 5.8% |
| S | 3750 | 5.8% |
| T | 2957 | 4.6% |
| R | 2957 | 4.6% |
| M | 2263 | 3.5% |
| F | 1854 | 2.9% |
| H | 1826 | 2.8% |
| Other values (16) | 12296 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 35953 | |
| , | 7909 | 15.3% |
| : | 3238 | 6.3% |
| " | 1687 | 3.3% |
| ; | 1332 | 2.6% |
| ' | 566 | 1.1% |
| / | 486 | 0.9% |
| % | 225 | 0.4% |
| # | 199 | 0.4% |
| ? | 57 | 0.1% |
| Other values (2) | 31 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 12010 | |
| 0 | 9775 | |
| 2 | 7516 | |
| 9 | 5201 | |
| 8 | 4032 | 7.3% |
| 5 | 3818 | 6.9% |
| 3 | 3764 | 6.8% |
| 7 | 3390 | 6.1% |
| 6 | 3356 | 6.1% |
| 4 | 2582 | 4.7% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 100 | |
| + | 7 | 6.2% |
| < | 4 | 3.5% |
| > | 2 | 1.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5543 | |
| [ | 87 | 1.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5533 | |
| ] | 87 | 1.5% |
Space Separator
| Value | Count | Frequency (%) |
| 270676 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5566 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1378606 | |
| Common | 394732 | 22.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 160711 | |
| o | 140496 | 10.2% |
| a | 114427 | 8.3% |
| t | 108900 | 7.9% |
| l | 98347 | 7.1% |
| n | 89158 | 6.5% |
| r | 81415 | 5.9% |
| d | 76949 | 5.6% |
| i | 72320 | 5.2% |
| s | 64649 | 4.7% |
| Other values (49) | 371234 |
Common
| Value | Count | Frequency (%) |
| 270676 | ||
| . | 35953 | 9.1% |
| 1 | 12010 | 3.0% |
| 0 | 9775 | 2.5% |
| , | 7909 | 2.0% |
| 2 | 7516 | 1.9% |
| - | 5566 | 1.4% |
| ( | 5543 | 1.4% |
| ) | 5533 | 1.4% |
| 9 | 5201 | 1.3% |
| Other values (22) | 29050 | 7.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1773305 | |
| None | 33 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 270676 | ||
| e | 160711 | 9.1% |
| o | 140496 | 7.9% |
| a | 114427 | 6.5% |
| t | 108900 | 6.1% |
| l | 98347 | 5.5% |
| n | 89158 | 5.0% |
| r | 81415 | 4.6% |
| d | 76949 | 4.3% |
| i | 72320 | 4.1% |
| Other values (74) | 559906 |
None
| Value | Count | Frequency (%) |
| ö | 14 | |
| á | 7 | |
| é | 5 | 15.2% |
| ó | 2 | 6.1% |
| ü | 2 | 6.1% |
| è | 2 | 6.1% |
| ñ | 1 | 3.0% |
fieldNumber
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 25.0% |
| Missing | 584193 |
| Missing (%) | > 99.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 6.125 |
| Min length | 6 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 12.5% |
Sample
| 1st row | 83-012 |
|---|---|
| 2nd row | 83-012 |
| 3rd row | 83-012 |
| 4th row | 83-012 |
| 5th row | 83-012 |
| Value | Count | Frequency (%) |
| 83-012 | 7 | |
| 83-024a | 1 | 12.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 8 | |
| 3 | 8 | |
| - | 8 | |
| 0 | 8 | |
| 2 | 8 | |
| 1 | 7 | |
| 4 | 1 | 2.0% |
| A | 1 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 40 | |
| Dash Punctuation | 8 | 16.3% |
| Uppercase Letter | 1 | 2.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 8 | |
| 3 | 8 | |
| 0 | 8 | |
| 2 | 8 | |
| 1 | 7 | |
| 4 | 1 | 2.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 48 | |
| Latin | 1 | 2.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 8 | 8 | |
| 3 | 8 | |
| - | 8 | |
| 0 | 8 | |
| 2 | 8 | |
| 1 | 7 | |
| 4 | 1 | 2.1% |
Latin
| Value | Count | Frequency (%) |
| A | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 8 | |
| 3 | 8 | |
| - | 8 | |
| 0 | 8 | |
| 2 | 8 | |
| 1 | 7 | |
| 4 | 1 | 2.0% |
| A | 1 | 2.0% |
eventDate
Text
Missing 
| Distinct | 31039 |
|---|---|
| Distinct (%) | 5.7% |
| Missing | 39140 |
| Missing (%) | 6.7% |
| Memory size | 4.5 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 10 |
| Mean length | 9.940764428 |
| Min length | 4 |
Unique
| Unique | 7117 ? |
|---|---|
| Unique (%) | 1.3% |
Sample
| 1st row | 1972-02-01/1972-02-03 |
|---|---|
| 2nd row | 1971-09-03 |
| 3rd row | 1992-10-15 |
| 4th row | 1992-06-24 |
| 5th row | 1998-09-03 |
| Value | Count | Frequency (%) |
| 1883 | 739 | 0.1% |
| 1973-09-22 | 723 | 0.1% |
| 1935 | 701 | 0.1% |
| 1998-10-09 | 690 | 0.1% |
| 1971-08-16 | 610 | 0.1% |
| 1940 | 598 | 0.1% |
| 1966-04-11 | 579 | 0.1% |
| 1970-06-19 | 564 | 0.1% |
| 1976-10-03 | 540 | 0.1% |
| 1971-07-31 | 521 | 0.1% |
| Other values (31029) | 538796 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 1054903 | |
| 1 | 987143 | |
| 0 | 809873 | |
| 9 | 731463 | |
| 2 | 355880 | 6.6% |
| 7 | 294421 | 5.4% |
| 6 | 287515 | 5.3% |
| 8 | 286448 | 5.3% |
| 3 | 210910 | 3.9% |
| 5 | 208423 | 3.8% |
| Other values (2) | 191344 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4348746 | |
| Dash Punctuation | 1054903 | 19.5% |
| Other Punctuation | 14674 | 0.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 987143 | |
| 0 | 809873 | |
| 9 | 731463 | |
| 2 | 355880 | 8.2% |
| 7 | 294421 | 6.8% |
| 6 | 287515 | 6.6% |
| 8 | 286448 | 6.6% |
| 3 | 210910 | 4.8% |
| 5 | 208423 | 4.8% |
| 4 | 176670 | 4.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1054903 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 14674 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5418323 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 1054903 | |
| 1 | 987143 | |
| 0 | 809873 | |
| 9 | 731463 | |
| 2 | 355880 | 6.6% |
| 7 | 294421 | 5.4% |
| 6 | 287515 | 5.3% |
| 8 | 286448 | 5.3% |
| 3 | 210910 | 3.9% |
| 5 | 208423 | 3.8% |
| Other values (2) | 191344 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5418323 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 1054903 | |
| 1 | 987143 | |
| 0 | 809873 | |
| 9 | 731463 | |
| 2 | 355880 | 6.6% |
| 7 | 294421 | 5.4% |
| 6 | 287515 | 5.3% |
| 8 | 286448 | 5.3% |
| 3 | 210910 | 3.9% |
| 5 | 208423 | 3.8% |
| Other values (2) | 191344 | 3.5% |
startDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 86170 |
| Missing (%) | 14.8% |
| Memory size | 4.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.783003468 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 32 |
|---|---|
| 2nd row | 246 |
| 3rd row | 289 |
| 4th row | 176 |
| 5th row | 246 |
| Value | Count | Frequency (%) |
| 227 | 2917 | 0.6% |
| 230 | 2852 | 0.6% |
| 233 | 2687 | 0.5% |
| 196 | 2660 | 0.5% |
| 210 | 2604 | 0.5% |
| 232 | 2592 | 0.5% |
| 145 | 2504 | 0.5% |
| 106 | 2489 | 0.5% |
| 228 | 2467 | 0.5% |
| 209 | 2408 | 0.5% |
| Other values (356) | 471851 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 290344 | |
| 2 | 266757 | |
| 3 | 147664 | |
| 0 | 99736 | 7.2% |
| 4 | 99682 | 7.2% |
| 8 | 97615 | 7.0% |
| 6 | 97257 | 7.0% |
| 9 | 96182 | 6.9% |
| 7 | 95676 | 6.9% |
| 5 | 95109 | 6.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1386022 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 290344 | |
| 2 | 266757 | |
| 3 | 147664 | |
| 0 | 99736 | 7.2% |
| 4 | 99682 | 7.2% |
| 8 | 97615 | 7.0% |
| 6 | 97257 | 7.0% |
| 9 | 96182 | 6.9% |
| 7 | 95676 | 6.9% |
| 5 | 95109 | 6.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1386022 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 290344 | |
| 2 | 266757 | |
| 3 | 147664 | |
| 0 | 99736 | 7.2% |
| 4 | 99682 | 7.2% |
| 8 | 97615 | 7.0% |
| 6 | 97257 | 7.0% |
| 9 | 96182 | 6.9% |
| 7 | 95676 | 6.9% |
| 5 | 95109 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1386022 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 290344 | |
| 2 | 266757 | |
| 3 | 147664 | |
| 0 | 99736 | 7.2% |
| 4 | 99682 | 7.2% |
| 8 | 97615 | 7.0% |
| 6 | 97257 | 7.0% |
| 9 | 96182 | 6.9% |
| 7 | 95676 | 6.9% |
| 5 | 95109 | 6.9% |
endDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 86170 |
| Missing (%) | 14.8% |
| Memory size | 4.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.783690172 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 34 |
|---|---|
| 2nd row | 246 |
| 3rd row | 289 |
| 4th row | 176 |
| 5th row | 246 |
| Value | Count | Frequency (%) |
| 230 | 3038 | 0.6% |
| 227 | 2924 | 0.6% |
| 233 | 2713 | 0.5% |
| 196 | 2664 | 0.5% |
| 210 | 2658 | 0.5% |
| 232 | 2593 | 0.5% |
| 145 | 2544 | 0.5% |
| 226 | 2520 | 0.5% |
| 228 | 2516 | 0.5% |
| 209 | 2373 | 0.5% |
| Other values (356) | 471488 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 291087 | |
| 2 | 266673 | |
| 3 | 148129 | |
| 0 | 99590 | 7.2% |
| 4 | 99486 | 7.2% |
| 8 | 98001 | 7.1% |
| 9 | 96639 | 7.0% |
| 6 | 96151 | 6.9% |
| 5 | 95612 | 6.9% |
| 7 | 94996 | 6.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1386364 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 291087 | |
| 2 | 266673 | |
| 3 | 148129 | |
| 0 | 99590 | 7.2% |
| 4 | 99486 | 7.2% |
| 8 | 98001 | 7.1% |
| 9 | 96639 | 7.0% |
| 6 | 96151 | 6.9% |
| 5 | 95612 | 6.9% |
| 7 | 94996 | 6.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1386364 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 291087 | |
| 2 | 266673 | |
| 3 | 148129 | |
| 0 | 99590 | 7.2% |
| 4 | 99486 | 7.2% |
| 8 | 98001 | 7.1% |
| 9 | 96639 | 7.0% |
| 6 | 96151 | 6.9% |
| 5 | 95612 | 6.9% |
| 7 | 94996 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1386364 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 291087 | |
| 2 | 266673 | |
| 3 | 148129 | |
| 0 | 99590 | 7.2% |
| 4 | 99486 | 7.2% |
| 8 | 98001 | 7.1% |
| 9 | 96639 | 7.0% |
| 6 | 96151 | 6.9% |
| 5 | 95612 | 6.9% |
| 7 | 94996 | 6.9% |
year
Text
Missing 
| Distinct | 184 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 39600 |
| Missing (%) | 6.8% |
| Memory size | 4.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1972 |
|---|---|
| 2nd row | 1971 |
| 3rd row | 1992 |
| 4th row | 1992 |
| 5th row | 1998 |
| Value | Count | Frequency (%) |
| 1971 | 16999 | 3.1% |
| 1966 | 15984 | 2.9% |
| 1969 | 15769 | 2.9% |
| 1970 | 15631 | 2.9% |
| 1976 | 15292 | 2.8% |
| 1980 | 15179 | 2.8% |
| 1979 | 14958 | 2.7% |
| 1972 | 14412 | 2.6% |
| 1961 | 12797 | 2.3% |
| 1984 | 12646 | 2.3% |
| Other values (174) | 394934 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 627645 | |
| 1 | 599705 | |
| 7 | 176112 | 8.1% |
| 6 | 174293 | 8.0% |
| 8 | 162231 | 7.4% |
| 0 | 112827 | 5.2% |
| 2 | 89669 | 4.1% |
| 5 | 85387 | 3.9% |
| 3 | 81823 | 3.8% |
| 4 | 68712 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2178404 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 627645 | |
| 1 | 599705 | |
| 7 | 176112 | 8.1% |
| 6 | 174293 | 8.0% |
| 8 | 162231 | 7.4% |
| 0 | 112827 | 5.2% |
| 2 | 89669 | 4.1% |
| 5 | 85387 | 3.9% |
| 3 | 81823 | 3.8% |
| 4 | 68712 | 3.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2178404 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 627645 | |
| 1 | 599705 | |
| 7 | 176112 | 8.1% |
| 6 | 174293 | 8.0% |
| 8 | 162231 | 7.4% |
| 0 | 112827 | 5.2% |
| 2 | 89669 | 4.1% |
| 5 | 85387 | 3.9% |
| 3 | 81823 | 3.8% |
| 4 | 68712 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2178404 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 627645 | |
| 1 | 599705 | |
| 7 | 176112 | 8.1% |
| 6 | 174293 | 8.0% |
| 8 | 162231 | 7.4% |
| 0 | 112827 | 5.2% |
| 2 | 89669 | 4.1% |
| 5 | 85387 | 3.9% |
| 3 | 81823 | 3.8% |
| 4 | 68712 | 3.2% |
month
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 59025 |
| Missing (%) | 10.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.163293829 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 9 |
| 3rd row | 10 |
| 4th row | 6 |
| 5th row | 9 |
| Value | Count | Frequency (%) |
| 8 | 67450 | |
| 5 | 63954 | |
| 7 | 63917 | |
| 6 | 59064 | |
| 4 | 55219 | |
| 3 | 46402 | |
| 10 | 42862 | |
| 9 | 36546 | |
| 11 | 25432 | 4.8% |
| 2 | 25273 | 4.8% |
| Other values (2) | 39057 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 132783 | |
| 8 | 67450 | |
| 5 | 63954 | |
| 7 | 63917 | |
| 6 | 59064 | |
| 4 | 55219 | |
| 3 | 46402 | 7.6% |
| 0 | 42862 | 7.0% |
| 2 | 42737 | 7.0% |
| 9 | 36546 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 610934 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 132783 | |
| 8 | 67450 | |
| 5 | 63954 | |
| 7 | 63917 | |
| 6 | 59064 | |
| 4 | 55219 | |
| 3 | 46402 | 7.6% |
| 0 | 42862 | 7.0% |
| 2 | 42737 | 7.0% |
| 9 | 36546 | 6.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 610934 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 132783 | |
| 8 | 67450 | |
| 5 | 63954 | |
| 7 | 63917 | |
| 6 | 59064 | |
| 4 | 55219 | |
| 3 | 46402 | 7.6% |
| 0 | 42862 | 7.0% |
| 2 | 42737 | 7.0% |
| 9 | 36546 | 6.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 610934 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 132783 | |
| 8 | 67450 | |
| 5 | 63954 | |
| 7 | 63917 | |
| 6 | 59064 | |
| 4 | 55219 | |
| 3 | 46402 | 7.6% |
| 0 | 42862 | 7.0% |
| 2 | 42737 | 7.0% |
| 9 | 36546 | 6.0% |
day
Text
Missing 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 100844 |
| Missing (%) | 17.3% |
| Memory size | 4.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.71638768 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 15 |
| 3rd row | 24 |
| 4th row | 3 |
| 5th row | 29 |
| Value | Count | Frequency (%) |
| 15 | 18907 | 3.9% |
| 13 | 17259 | 3.6% |
| 21 | 17015 | 3.5% |
| 25 | 16944 | 3.5% |
| 19 | 16843 | 3.5% |
| 24 | 16667 | 3.4% |
| 16 | 16371 | 3.4% |
| 3 | 16363 | 3.4% |
| 22 | 16276 | 3.4% |
| 28 | 16272 | 3.4% |
| Other values (21) | 314440 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 220646 | |
| 2 | 206016 | |
| 3 | 73256 | 8.8% |
| 5 | 51604 | 6.2% |
| 8 | 47441 | 5.7% |
| 9 | 46929 | 5.7% |
| 0 | 46463 | 5.6% |
| 4 | 46187 | 5.6% |
| 6 | 45993 | 5.5% |
| 7 | 45093 | 5.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 829628 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 220646 | |
| 2 | 206016 | |
| 3 | 73256 | 8.8% |
| 5 | 51604 | 6.2% |
| 8 | 47441 | 5.7% |
| 9 | 46929 | 5.7% |
| 0 | 46463 | 5.6% |
| 4 | 46187 | 5.6% |
| 6 | 45993 | 5.5% |
| 7 | 45093 | 5.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 829628 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 220646 | |
| 2 | 206016 | |
| 3 | 73256 | 8.8% |
| 5 | 51604 | 6.2% |
| 8 | 47441 | 5.7% |
| 9 | 46929 | 5.7% |
| 0 | 46463 | 5.6% |
| 4 | 46187 | 5.6% |
| 6 | 45993 | 5.5% |
| 7 | 45093 | 5.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 829628 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 220646 | |
| 2 | 206016 | |
| 3 | 73256 | 8.8% |
| 5 | 51604 | 6.2% |
| 8 | 47441 | 5.7% |
| 9 | 46929 | 5.7% |
| 0 | 46463 | 5.6% |
| 4 | 46187 | 5.6% |
| 6 | 45993 | 5.5% |
| 7 | 45093 | 5.4% |
| Distinct | 42558 |
|---|---|
| Distinct (%) | 7.3% |
| Missing | 51 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 194 |
|---|---|
| Median length | 11 |
| Mean length | 12.14387743 |
| Min length | 4 |
Unique
| Unique | 14192 ? |
|---|---|
| Unique (%) | 2.4% |
Sample
| 1st row | 01-03 February 1972 |
|---|---|
| 2nd row | 3 Sep 1971 |
| 3rd row | -- --- ---- |
| 4th row | 15 Oct 1992; 09:05-13:00 hrs |
| 5th row | 24 Jun 1992; 10:30-11:40 hrs |
| Value | Count | Frequency (%) |
| 173374 | 9.4% | |
| may | 65316 | 3.5% |
| aug | 63760 | 3.5% |
| jul | 58386 | 3.2% |
| jun | 53770 | 2.9% |
| apr | 50984 | 2.8% |
| mar | 43098 | 2.3% |
| oct | 40349 | 2.2% |
| sep | 34295 | 1.9% |
| hrs | 24306 | 1.3% |
| Other values (3264) | 1238022 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1261510 | ||
| 1 | 874532 | 12.3% |
| 9 | 688315 | 9.7% |
| - | 499756 | 7.0% |
| 2 | 328876 | 4.6% |
| 0 | 243409 | 3.4% |
| 6 | 227222 | 3.2% |
| 7 | 227024 | 3.2% |
| 8 | 217953 | 3.1% |
| u | 208644 | 2.9% |
| Other values (64) | 2316605 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3263423 | |
| Lowercase Letter | 1431190 | |
| Space Separator | 1261510 | 17.8% |
| Uppercase Letter | 543907 | 7.7% |
| Dash Punctuation | 499756 | 7.0% |
| Other Punctuation | 92897 | 1.3% |
| Open Punctuation | 581 | < 0.1% |
| Close Punctuation | 581 | < 0.1% |
| Format | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 208644 | |
| r | 158708 | |
| a | 157043 | |
| e | 120723 | |
| n | 97727 | 6.8% |
| p | 94749 | 6.6% |
| y | 81426 | 5.7% |
| l | 78861 | 5.5% |
| g | 78230 | 5.5% |
| c | 71344 | 5.0% |
| Other values (16) | 283735 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 147803 | |
| A | 124847 | |
| M | 113882 | |
| O | 43248 | 8.0% |
| S | 39048 | 7.2% |
| F | 26562 | 4.9% |
| N | 26070 | 4.8% |
| D | 18361 | 3.4% |
| C | 3404 | 0.6% |
| E | 144 | < 0.1% |
| Other values (13) | 538 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 874532 | |
| 9 | 688315 | |
| 2 | 328876 | 10.1% |
| 0 | 243409 | 7.5% |
| 6 | 227222 | 7.0% |
| 7 | 227024 | 7.0% |
| 8 | 217953 | 6.7% |
| 3 | 176664 | 5.4% |
| 5 | 152254 | 4.7% |
| 4 | 127174 | 3.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 41924 | |
| ; | 34825 | |
| . | 14987 | 16.1% |
| , | 770 | 0.8% |
| / | 307 | 0.3% |
| ' | 46 | < 0.1% |
| " | 20 | < 0.1% |
| ? | 18 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 580 | |
| [ | 1 | 0.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 580 | |
| ] | 1 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 1261510 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 499756 |
Format
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5118749 | |
| Latin | 1975097 | 27.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 208644 | 10.6% |
| r | 158708 | 8.0% |
| a | 157043 | 8.0% |
| J | 147803 | 7.5% |
| A | 124847 | 6.3% |
| e | 120723 | 6.1% |
| M | 113882 | 5.8% |
| n | 97727 | 4.9% |
| p | 94749 | 4.8% |
| y | 81426 | 4.1% |
| Other values (39) | 669545 |
Common
| Value | Count | Frequency (%) |
| 1261510 | ||
| 1 | 874532 | |
| 9 | 688315 | |
| - | 499756 | 9.8% |
| 2 | 328876 | 6.4% |
| 0 | 243409 | 4.8% |
| 6 | 227222 | 4.4% |
| 7 | 227024 | 4.4% |
| 8 | 217953 | 4.3% |
| 3 | 176664 | 3.5% |
| Other values (15) | 373488 | 7.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7093845 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1261510 | ||
| 1 | 874532 | 12.3% |
| 9 | 688315 | 9.7% |
| - | 499756 | 7.0% |
| 2 | 328876 | 4.6% |
| 0 | 243409 | 3.4% |
| 6 | 227222 | 3.2% |
| 7 | 227024 | 3.2% |
| 8 | 217953 | 3.1% |
| u | 208644 | 2.9% |
| Other values (63) | 2316604 |
None
| Value | Count | Frequency (%) |
| | 1 |
higherGeography
Text
| Distinct | 6286 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 4414 |
| Missing (%) | 0.8% |
| Memory size | 4.5 MiB |
Length
| Max length | 167 |
|---|---|
| Median length | 118 |
| Mean length | 48.81643259 |
| Min length | 4 |
Unique
| Unique | 1092 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Oceania, Papua New Guinea, Central Province, Kairuku-Hiri District, New Guinea |
|---|---|
| 2nd row | North America, United States, North Carolina, Buncombe - Yancey |
| 3rd row | Oceania, Pacific Ocean , Tonga, Tonga Islands, Tongatapu Island Group, Tonga Islands |
| 4th row | North America, Grenada, St. George Parish, Lesser Antilles, Windward Islands, Grenada Island |
| 5th row | North America, United States, Virginia, Augusta |
| Value | Count | Frequency (%) |
| america | 483266 | 12.9% |
| north | 476209 | 12.7% |
| states | 351020 | 9.4% |
| united | 349359 | 9.4% |
| virginia | 96173 | 2.6% |
| south | 71896 | 1.9% |
| islands | 71471 | 1.9% |
| carolina | 61728 | 1.7% |
| 54664 | 1.5% | |
| asia | 39306 | 1.1% |
| Other values (4622) | 1680221 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3155526 | 11.1% | |
| a | 2668328 | 9.4% |
| i | 2176293 | 7.7% |
| e | 2119779 | 7.5% |
| t | 1973350 | 7.0% |
| r | 1844062 | 6.5% |
| , | 1669519 | 5.9% |
| n | 1511861 | 5.3% |
| o | 1298349 | 4.6% |
| s | 1011828 | 3.6% |
| Other values (73) | 8874238 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 19740985 | |
| Uppercase Letter | 3656252 | 12.9% |
| Space Separator | 3155526 | 11.1% |
| Other Punctuation | 1685470 | 6.0% |
| Dash Punctuation | 42316 | 0.1% |
| Open Punctuation | 11057 | < 0.1% |
| Close Punctuation | 11052 | < 0.1% |
| Math Symbol | 409 | < 0.1% |
| Decimal Number | 64 | < 0.1% |
| Modifier Letter | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2668328 | |
| i | 2176293 | |
| e | 2119779 | |
| t | 1973350 | |
| r | 1844062 | |
| n | 1511861 | |
| o | 1298349 | 6.6% |
| s | 1011828 | 5.1% |
| c | 896985 | 4.5% |
| h | 740667 | 3.8% |
| Other values (28) | 3499483 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 644681 | |
| N | 528051 | |
| S | 527098 | |
| U | 359922 | |
| P | 226035 | 6.2% |
| C | 185016 | 5.1% |
| M | 170358 | 4.7% |
| I | 135691 | 3.7% |
| V | 116151 | 3.2% |
| G | 115778 | 3.2% |
| Other values (18) | 647471 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1669519 | |
| . | 13671 | 0.8% |
| ' | 2228 | 0.1% |
| ? | 41 | < 0.1% |
| / | 11 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 42077 | |
| – | 239 | 0.6% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 10381 | |
| [ | 676 | 6.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 10376 | |
| ] | 676 | 6.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 389 | |
| + | 20 | 4.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 32 | |
| 0 | 32 |
Space Separator
| Value | Count | Frequency (%) |
| 3155526 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23397237 | |
| Common | 4905896 | 17.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2668328 | 11.4% |
| i | 2176293 | 9.3% |
| e | 2119779 | 9.1% |
| t | 1973350 | 8.4% |
| r | 1844062 | 7.9% |
| n | 1511861 | 6.5% |
| o | 1298349 | 5.5% |
| s | 1011828 | 4.3% |
| c | 896985 | 3.8% |
| h | 740667 | 3.2% |
| Other values (56) | 7155735 |
Common
| Value | Count | Frequency (%) |
| 3155526 | ||
| , | 1669519 | |
| - | 42077 | 0.9% |
| . | 13671 | 0.3% |
| ( | 10381 | 0.2% |
| ) | 10376 | 0.2% |
| ' | 2228 | < 0.1% |
| [ | 676 | < 0.1% |
| ] | 676 | < 0.1% |
| = | 389 | < 0.1% |
| Other values (7) | 377 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28276056 | |
| None | 26786 | 0.1% |
| Punctuation | 239 | < 0.1% |
| Latin Ext Additional | 50 | < 0.1% |
| Modifier Letters | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3155526 | 11.2% | |
| a | 2668328 | 9.4% |
| i | 2176293 | 7.7% |
| e | 2119779 | 7.5% |
| t | 1973350 | 7.0% |
| r | 1844062 | 6.5% |
| , | 1669519 | 5.9% |
| n | 1511861 | 5.3% |
| o | 1298349 | 4.6% |
| s | 1011828 | 3.6% |
| Other values (57) | 8847161 |
None
| Value | Count | Frequency (%) |
| é | 6953 | |
| á | 5925 | |
| ã | 4537 | |
| í | 4305 | |
| ó | 3223 | |
| ô | 1182 | 4.4% |
| ñ | 439 | 1.6% |
| â | 51 | 0.2% |
| Đ | 50 | 0.2% |
| ı | 48 | 0.2% |
| Other values (3) | 73 | 0.3% |
Punctuation
| Value | Count | Frequency (%) |
| – | 239 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ả | 50 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 2 |
continent
Text
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10069 |
| Missing (%) | 1.7% |
| Memory size | 4.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 11.7863662 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | OCEANIA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | OCEANIA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 416962 | |
| south_america | 64731 | 11.3% |
| asia | 39723 | 6.9% |
| oceania | 29733 | 5.2% |
| africa | 20601 | 3.6% |
| europe | 2382 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1143500 | |
| R | 921638 | |
| I | 571750 | |
| C | 532027 | |
| E | 516190 | |
| O | 513808 | |
| T | 481693 | |
| H | 481693 | |
| _ | 481693 | |
| M | 481693 | |
| Other values (5) | 641245 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6285237 | |
| Connector Punctuation | 481693 | 7.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1143500 | |
| R | 921638 | |
| I | 571750 | |
| C | 532027 | |
| E | 516190 | |
| O | 513808 | |
| T | 481693 | |
| H | 481693 | |
| M | 481693 | |
| N | 446695 | 7.1% |
| Other values (4) | 194550 | 3.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 481693 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6285237 | |
| Common | 481693 | 7.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 1143500 | |
| R | 921638 | |
| I | 571750 | |
| C | 532027 | |
| E | 516190 | |
| O | 513808 | |
| T | 481693 | |
| H | 481693 | |
| M | 481693 | |
| N | 446695 | 7.1% |
| Other values (4) | 194550 | 3.1% |
Common
| Value | Count | Frequency (%) |
| _ | 481693 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6766930 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 1143500 | |
| R | 921638 | |
| I | 571750 | |
| C | 532027 | |
| E | 516190 | |
| O | 513808 | |
| T | 481693 | |
| H | 481693 | |
| _ | 481693 | |
| M | 481693 | |
| Other values (5) | 641245 |
waterBody
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 555994 |
| Missing (%) | 95.2% |
| Memory size | 4.5 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 13 |
| Mean length | 12.96972383 |
| Min length | 12 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Pacific Ocean |
|---|---|
| 2nd row | Pacific Ocean |
| 3rd row | Pacific Ocean |
| 4th row | Pacific Ocean |
| 5th row | Indian Ocean |
| Value | Count | Frequency (%) |
| ocean | 28207 | |
| pacific | 26665 | |
| indian | 1198 | 2.1% |
| atlantic | 344 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 81881 | |
| a | 56414 | |
| i | 54872 | |
| n | 30947 | 8.5% |
| 28207 | 7.7% | |
| O | 28207 | 7.7% |
| e | 28207 | 7.7% |
| P | 26665 | 7.3% |
| f | 26665 | 7.3% |
| I | 1198 | 0.3% |
| Other values (4) | 2574 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 281216 | |
| Uppercase Letter | 56414 | 15.4% |
| Space Separator | 28207 | 7.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 81881 | |
| a | 56414 | |
| i | 54872 | |
| n | 30947 | 11.0% |
| e | 28207 | 10.0% |
| f | 26665 | 9.5% |
| d | 1198 | 0.4% |
| t | 688 | 0.2% |
| l | 344 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 28207 | |
| P | 26665 | |
| I | 1198 | 2.1% |
| A | 344 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 28207 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 337630 | |
| Common | 28207 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 81881 | |
| a | 56414 | |
| i | 54872 | |
| n | 30947 | 9.2% |
| O | 28207 | 8.4% |
| e | 28207 | 8.4% |
| P | 26665 | 7.9% |
| f | 26665 | 7.9% |
| I | 1198 | 0.4% |
| d | 1198 | 0.4% |
| Other values (3) | 1376 | 0.4% |
Common
| Value | Count | Frequency (%) |
| 28207 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 365837 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 81881 | |
| a | 56414 | |
| i | 54872 | |
| n | 30947 | 8.5% |
| 28207 | 7.7% | |
| O | 28207 | 7.7% |
| e | 28207 | 7.7% |
| P | 26665 | 7.3% |
| f | 26665 | 7.3% |
| I | 1198 | 0.3% |
| Other values (4) | 2574 | 0.7% |
islandGroup
Text
Missing 
| Distinct | 41 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 564324 |
| Missing (%) | 96.6% |
| Memory size | 4.5 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 25 |
| Mean length | 13.3327967 |
| Min length | 10 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Windward Islands |
|---|---|
| 2nd row | Virgin Islands |
| 3rd row | Hispaniola |
| 4th row | Hispaniola |
| 5th row | Greater Sunda Islands |
| Value | Count | Frequency (%) |
| islands | 10225 | |
| hispaniola | 8927 | |
| virgin | 2527 | 7.7% |
| windward | 2377 | 7.2% |
| bahama | 1504 | 4.6% |
| leeward | 1357 | 4.1% |
| sunda | 1019 | 3.1% |
| greater | 1018 | 3.1% |
| northern | 671 | 2.0% |
| solomon | 655 | 2.0% |
| Other values (48) | 2663 | 8.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 41073 | |
| s | 30081 | |
| n | 27407 | |
| i | 26949 | |
| l | 20663 | 7.8% |
| d | 17902 | 6.8% |
| 13066 | 4.9% | |
| o | 12195 | 4.6% |
| r | 10747 | 4.1% |
| I | 10283 | 3.9% |
| Other values (35) | 54650 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 218943 | |
| Uppercase Letter | 32978 | 12.4% |
| Space Separator | 13066 | 4.9% |
| Open Punctuation | 8 | < 0.1% |
| Math Symbol | 8 | < 0.1% |
| Close Punctuation | 8 | < 0.1% |
| Other Punctuation | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 41073 | |
| s | 30081 | |
| n | 27407 | |
| i | 26949 | |
| l | 20663 | |
| d | 17902 | |
| o | 12195 | 5.6% |
| r | 10747 | 4.9% |
| p | 9142 | 4.2% |
| e | 5828 | 2.7% |
| Other values (13) | 16956 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 10283 | |
| H | 8927 | |
| V | 2533 | 7.7% |
| W | 2377 | 7.2% |
| S | 1736 | 5.3% |
| B | 1647 | 5.0% |
| L | 1372 | 4.2% |
| G | 1061 | 3.2% |
| C | 934 | 2.8% |
| N | 775 | 2.4% |
| Other values (7) | 1333 | 4.0% |
Space Separator
| Value | Count | Frequency (%) |
| 13066 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 8 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 8 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 8 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 251921 | |
| Common | 13095 | 4.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 41073 | |
| s | 30081 | |
| n | 27407 | |
| i | 26949 | |
| l | 20663 | |
| d | 17902 | 7.1% |
| o | 12195 | 4.8% |
| r | 10747 | 4.3% |
| I | 10283 | 4.1% |
| p | 9142 | 3.6% |
| Other values (30) | 45479 |
Common
| Value | Count | Frequency (%) |
| 13066 | ||
| ( | 8 | 0.1% |
| = | 8 | 0.1% |
| ) | 8 | 0.1% |
| . | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 265016 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 41073 | |
| s | 30081 | |
| n | 27407 | |
| i | 26949 | |
| l | 20663 | 7.8% |
| d | 17902 | 6.8% |
| 13066 | 4.9% | |
| o | 12195 | 4.6% |
| r | 10747 | 4.1% |
| I | 10283 | 3.9% |
| Other values (35) | 54650 |
island
Text
Missing 
| Distinct | 39 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 576136 |
| Missing (%) | 98.6% |
| Memory size | 4.5 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 10 |
| Mean length | 10.77445753 |
| Min length | 6 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | New Guinea |
|---|---|
| 2nd row | Grenada Island |
| 3rd row | New Guinea |
| 4th row | New Guinea |
| 5th row | Little Swan Island |
| Value | Count | Frequency (%) |
| new | 4350 | |
| guinea | 4350 | |
| island | 1306 | 8.7% |
| borneo | 712 | 4.7% |
| bougainville | 652 | 4.3% |
| sumatra | 558 | 3.7% |
| okinawa | 493 | 3.3% |
| grenada | 267 | 1.8% |
| isla | 258 | 1.7% |
| swan | 241 | 1.6% |
| Other values (44) | 1803 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 11374 | |
| a | 10388 | |
| n | 8928 | |
| 6925 | 8.0% | |
| i | 6731 | 7.7% |
| u | 5716 | 6.6% |
| w | 5086 | 5.9% |
| G | 4959 | 5.7% |
| N | 4459 | 5.1% |
| l | 3060 | 3.5% |
| Other values (34) | 19270 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 65206 | |
| Uppercase Letter | 14765 | 17.0% |
| Space Separator | 6925 | 8.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 11374 | |
| a | 10388 | |
| n | 8928 | |
| i | 6731 | |
| u | 5716 | |
| w | 5086 | |
| l | 3060 | 4.7% |
| o | 2768 | 4.2% |
| d | 2350 | 3.6% |
| s | 2071 | 3.2% |
| Other values (14) | 6734 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 4959 | |
| N | 4459 | |
| I | 1683 | 11.4% |
| B | 1407 | 9.5% |
| S | 841 | 5.7% |
| O | 512 | 3.5% |
| U | 199 | 1.3% |
| K | 190 | 1.3% |
| L | 178 | 1.2% |
| R | 151 | 1.0% |
| Other values (9) | 186 | 1.3% |
Space Separator
| Value | Count | Frequency (%) |
| 6925 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 79971 | |
| Common | 6925 | 8.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 11374 | |
| a | 10388 | |
| n | 8928 | |
| i | 6731 | |
| u | 5716 | 7.1% |
| w | 5086 | 6.4% |
| G | 4959 | 6.2% |
| N | 4459 | 5.6% |
| l | 3060 | 3.8% |
| o | 2768 | 3.5% |
| Other values (33) | 16502 |
Common
| Value | Count | Frequency (%) |
| 6925 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 86748 | |
| None | 148 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 11374 | |
| a | 10388 | |
| n | 8928 | |
| 6925 | 8.0% | |
| i | 6731 | 7.8% |
| u | 5716 | 6.6% |
| w | 5086 | 5.9% |
| G | 4959 | 5.7% |
| N | 4459 | 5.1% |
| l | 3060 | 3.5% |
| Other values (33) | 19122 |
None
| Value | Count | Frequency (%) |
| á | 148 |
countryCode
Text
Missing 
| Distinct | 198 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10837 |
| Missing (%) | 1.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | PG |
|---|---|
| 2nd row | US |
| 3rd row | TO |
| 4th row | GD |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 334216 | |
| mx | 22787 | 4.0% |
| ec | 16235 | 2.8% |
| br | 14722 | 2.6% |
| pe | 12875 | 2.2% |
| ph | 11392 | 2.0% |
| hn | 10938 | 1.9% |
| pa | 7718 | 1.3% |
| jm | 7293 | 1.3% |
| gu | 5665 | 1.0% |
| Other values (188) | 129523 | 22.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 348407 | |
| S | 342999 | |
| P | 50905 | 4.4% |
| M | 48558 | 4.2% |
| C | 41150 | 3.6% |
| E | 38991 | 3.4% |
| H | 32407 | 2.8% |
| G | 25767 | 2.2% |
| R | 24403 | 2.1% |
| X | 22787 | 2.0% |
| Other values (16) | 170354 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1146728 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 348407 | |
| S | 342999 | |
| P | 50905 | 4.4% |
| M | 48558 | 4.2% |
| C | 41150 | 3.6% |
| E | 38991 | 3.4% |
| H | 32407 | 2.8% |
| G | 25767 | 2.2% |
| R | 24403 | 2.1% |
| X | 22787 | 2.0% |
| Other values (16) | 170354 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1146728 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 348407 | |
| S | 342999 | |
| P | 50905 | 4.4% |
| M | 48558 | 4.2% |
| C | 41150 | 3.6% |
| E | 38991 | 3.4% |
| H | 32407 | 2.8% |
| G | 25767 | 2.2% |
| R | 24403 | 2.1% |
| X | 22787 | 2.0% |
| Other values (16) | 170354 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1146728 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 348407 | |
| S | 342999 | |
| P | 50905 | 4.4% |
| M | 48558 | 4.2% |
| C | 41150 | 3.6% |
| E | 38991 | 3.4% |
| H | 32407 | 2.8% |
| G | 25767 | 2.2% |
| R | 24403 | 2.1% |
| X | 22787 | 2.0% |
| Other values (16) | 170354 |
stateProvince
Text
Missing 
| Distinct | 2059 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 17001 |
| Missing (%) | 2.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 69 |
|---|---|
| Median length | 52 |
| Mean length | 10.58665021 |
| Min length | 3 |
Unique
| Unique | 356 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Central Province |
|---|---|
| 2nd row | North Carolina |
| 3rd row | Tonga Islands |
| 4th row | St. George Parish |
| 5th row | Virginia |
| Value | Count | Frequency (%) |
| virginia | 93314 | 11.0% |
| carolina | 61709 | 7.2% |
| north | 57614 | 6.8% |
| maryland | 32649 | 3.8% |
| province | 27443 | 3.2% |
| pennsylvania | 18911 | 2.2% |
| west | 18140 | 2.1% |
| florida | 18100 | 2.1% |
| island | 18015 | 2.1% |
| tennessee | 17444 | 2.0% |
| Other values (1937) | 487863 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 826291 | |
| i | 632216 | 10.5% |
| n | 557794 | 9.3% |
| r | 474453 | 7.9% |
| o | 407390 | 6.8% |
| e | 304504 | 5.1% |
| 284002 | 4.7% | |
| l | 264922 | 4.4% |
| s | 256173 | 4.3% |
| t | 191100 | 3.2% |
| Other values (62) | 1805903 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4862616 | |
| Uppercase Letter | 830502 | 13.8% |
| Space Separator | 284002 | 4.7% |
| Dash Punctuation | 16262 | 0.3% |
| Other Punctuation | 9979 | 0.2% |
| Open Punctuation | 537 | < 0.1% |
| Close Punctuation | 532 | < 0.1% |
| Math Symbol | 318 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 826291 | |
| i | 632216 | |
| n | 557794 | |
| r | 474453 | |
| o | 407390 | |
| e | 304504 | 6.3% |
| l | 264922 | 5.4% |
| s | 256173 | 5.3% |
| t | 191100 | 3.9% |
| g | 142256 | 2.9% |
| Other values (24) | 805517 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 108499 | |
| V | 99187 | |
| P | 90787 | |
| N | 84504 | |
| M | 71209 | |
| I | 44333 | 5.3% |
| S | 44167 | 5.3% |
| T | 42704 | 5.1% |
| G | 35681 | 4.3% |
| A | 34622 | 4.2% |
| Other values (17) | 174809 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 9193 | |
| ' | 757 | 7.6% |
| ? | 19 | 0.2% |
| / | 6 | 0.1% |
| , | 4 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 298 | |
| + | 20 | 6.3% |
Space Separator
| Value | Count | Frequency (%) |
| 284002 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 16262 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 537 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 532 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5693118 | |
| Common | 311630 | 5.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 826291 | |
| i | 632216 | 11.1% |
| n | 557794 | 9.8% |
| r | 474453 | 8.3% |
| o | 407390 | 7.2% |
| e | 304504 | 5.3% |
| l | 264922 | 4.7% |
| s | 256173 | 4.5% |
| t | 191100 | 3.4% |
| g | 142256 | 2.5% |
| Other values (51) | 1636019 |
Common
| Value | Count | Frequency (%) |
| 284002 | ||
| - | 16262 | 5.2% |
| . | 9193 | 2.9% |
| ' | 757 | 0.2% |
| ( | 537 | 0.2% |
| ) | 532 | 0.2% |
| = | 298 | 0.1% |
| + | 20 | < 0.1% |
| ? | 19 | < 0.1% |
| / | 6 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5984875 | |
| None | 19873 | 0.3% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 826291 | |
| i | 632216 | 10.6% |
| n | 557794 | 9.3% |
| r | 474453 | 7.9% |
| o | 407390 | 6.8% |
| e | 304504 | 5.1% |
| 284002 | 4.7% | |
| l | 264922 | 4.4% |
| s | 256173 | 4.3% |
| t | 191100 | 3.2% |
| Other values (53) | 1786030 |
None
| Value | Count | Frequency (%) |
| á | 4907 | |
| é | 4585 | |
| ã | 3690 | |
| ó | 2908 | |
| í | 2325 | |
| ô | 1036 | 5.2% |
| ñ | 367 | 1.8% |
| ı | 48 | 0.2% |
| Î | 7 | < 0.1% |
county
Text
Missing 
| Distinct | 3056 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 191557 |
| Missing (%) | 32.8% |
| Memory size | 4.5 MiB |
Length
| Max length | 56 |
|---|---|
| Median length | 43 |
| Mean length | 9.394395432 |
| Min length | 3 |
Unique
| Unique | 504 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Kairuku-Hiri District |
|---|---|
| 2nd row | Buncombe - Yancey |
| 3rd row | Tongatapu Island Group |
| 4th row | Augusta |
| 5th row | Elko |
| Value | Count | Frequency (%) |
| 21119 | 3.8% | |
| island | 14180 | 2.6% |
| swain | 12742 | 2.3% |
| city | 8568 | 1.6% |
| province | 8458 | 1.5% |
| giles | 8024 | 1.5% |
| frederick | 7508 | 1.4% |
| macon | 7377 | 1.3% |
| municipality | 7367 | 1.3% |
| haywood | 7297 | 1.3% |
| Other values (2826) | 448585 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 361375 | 9.8% |
| e | 318401 | 8.6% |
| n | 281913 | 7.6% |
| o | 250126 | 6.8% |
| i | 237836 | 6.4% |
| r | 221961 | 6.0% |
| l | 181195 | 4.9% |
| 158581 | 4.3% | |
| s | 154891 | 4.2% |
| t | 142082 | 3.9% |
| Other values (64) | 1380292 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2956891 | |
| Uppercase Letter | 526246 | 14.3% |
| Space Separator | 158581 | 4.3% |
| Dash Punctuation | 25865 | 0.7% |
| Close Punctuation | 7839 | 0.2% |
| Open Punctuation | 7839 | 0.2% |
| Other Punctuation | 5243 | 0.1% |
| Math Symbol | 83 | < 0.1% |
| Decimal Number | 64 | < 0.1% |
| Modifier Letter | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 361375 | |
| e | 318401 | |
| n | 281913 | |
| o | 250126 | 8.5% |
| i | 237836 | 8.0% |
| r | 221961 | 7.5% |
| l | 181195 | 6.1% |
| s | 154891 | 5.2% |
| t | 142082 | 4.8% |
| c | 111847 | 3.8% |
| Other values (25) | 695264 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 56403 | 10.7% |
| S | 49852 | 9.5% |
| C | 48154 | 9.2% |
| P | 46114 | 8.8% |
| G | 36649 | 7.0% |
| B | 29767 | 5.7% |
| I | 27305 | 5.2% |
| A | 26724 | 5.1% |
| H | 24796 | 4.7% |
| R | 21003 | 4.0% |
| Other values (15) | 159479 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3451 | |
| ' | 1471 | |
| , | 294 | 5.6% |
| ? | 22 | 0.4% |
| / | 5 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 25626 | |
| – | 239 | 0.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 32 | |
| 0 | 32 |
Space Separator
| Value | Count | Frequency (%) |
| 158581 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 7839 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 7839 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 83 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3483137 | |
| Common | 205516 | 5.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 361375 | 10.4% |
| e | 318401 | 9.1% |
| n | 281913 | 8.1% |
| o | 250126 | 7.2% |
| i | 237836 | 6.8% |
| r | 221961 | 6.4% |
| l | 181195 | 5.2% |
| s | 154891 | 4.4% |
| t | 142082 | 4.1% |
| c | 111847 | 3.2% |
| Other values (50) | 1221510 |
Common
| Value | Count | Frequency (%) |
| 158581 | ||
| - | 25626 | 12.5% |
| ) | 7839 | 3.8% |
| ( | 7839 | 3.8% |
| . | 3451 | 1.7% |
| ' | 1471 | 0.7% |
| , | 294 | 0.1% |
| – | 239 | 0.1% |
| = | 83 | < 0.1% |
| 1 | 32 | < 0.1% |
| Other values (4) | 61 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3684587 | |
| None | 3825 | 0.1% |
| Punctuation | 239 | < 0.1% |
| Modifier Letters | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 361375 | 9.8% |
| e | 318401 | 8.6% |
| n | 281913 | 7.7% |
| o | 250126 | 6.8% |
| i | 237836 | 6.5% |
| r | 221961 | 6.0% |
| l | 181195 | 4.9% |
| 158581 | 4.3% | |
| s | 154891 | 4.2% |
| t | 142082 | 3.9% |
| Other values (53) | 1376226 |
None
| Value | Count | Frequency (%) |
| é | 1444 | |
| í | 911 | |
| á | 870 | |
| ó | 315 | 8.2% |
| ô | 96 | 2.5% |
| ñ | 72 | 1.9% |
| â | 51 | 1.3% |
| ü | 38 | 1.0% |
| è | 28 | 0.7% |
Punctuation
| Value | Count | Frequency (%) |
| – | 239 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 2 |
locality
Text
| Distinct | 56242 |
|---|---|
| Distinct (%) | 9.7% |
| Missing | 2303 |
| Missing (%) | 0.4% |
| Memory size | 4.5 MiB |
Length
| Max length | 295 |
|---|---|
| Median length | 193 |
| Mean length | 54.40064066 |
| Min length | 2 |
Unique
| Unique | 24789 ? |
|---|---|
| Unique (%) | 4.3% |
Sample
| 1st row | Kairuku, Yule Island |
|---|---|
| 2nd row | Pisgah National Forest, near Cane River Gap |
| 3rd row | No Locality Data |
| 4th row | Tongatapu Island, adjacent to Fua'amotu Airport |
| 5th row | Grand Anse Bay, west end of, along road to jetty just east of base of Quarantine Point |
| Value | Count | Frequency (%) |
| of | 456712 | 8.0% |
| mi | 190409 | 3.3% |
| road | 182915 | 3.2% |
| route | 156226 | 2.7% |
| on | 147202 | 2.6% |
| national | 106083 | 1.8% |
| by | 93415 | 1.6% |
| forest | 89661 | 1.6% |
| junction | 81776 | 1.4% |
| km | 68711 | 1.2% |
| Other values (30771) | 4165761 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5156973 | ||
| a | 2389818 | 7.5% |
| o | 2384544 | 7.5% |
| e | 1748111 | 5.5% |
| n | 1666496 | 5.3% |
| i | 1568945 | 5.0% |
| t | 1523016 | 4.8% |
| r | 1291111 | 4.1% |
| l | 964589 | 3.0% |
| , | 845140 | 2.7% |
| Other values (100) | 12116881 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 19646027 | |
| Space Separator | 5156973 | 16.3% |
| Uppercase Letter | 3980245 | 12.6% |
| Other Punctuation | 1240931 | 3.9% |
| Decimal Number | 1169470 | 3.7% |
| Open Punctuation | 200092 | 0.6% |
| Close Punctuation | 200069 | 0.6% |
| Dash Punctuation | 36149 | 0.1% |
| Math Symbol | 25534 | 0.1% |
| Format | 126 | < 0.1% |
| Other values (2) | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2389818 | |
| o | 2384544 | |
| e | 1748111 | 8.9% |
| n | 1666496 | 8.5% |
| i | 1568945 | 8.0% |
| t | 1523016 | 7.8% |
| r | 1291111 | 6.6% |
| l | 964589 | 4.9% |
| u | 736911 | 3.8% |
| s | 725961 | 3.7% |
| Other values (38) | 4646525 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 444979 | 11.2% |
| S | 427726 | 10.7% |
| N | 377809 | 9.5% |
| C | 256106 | 6.4% |
| M | 232345 | 5.8% |
| E | 225035 | 5.7% |
| W | 219583 | 5.5% |
| P | 208932 | 5.2% |
| F | 190052 | 4.8% |
| A | 187582 | 4.7% |
| Other values (18) | 1210096 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 845140 | |
| . | 367701 | |
| ' | 12150 | 1.0% |
| ; | 6267 | 0.5% |
| / | 6172 | 0.5% |
| " | 1760 | 0.1% |
| : | 693 | 0.1% |
| ? | 661 | 0.1% |
| # | 351 | < 0.1% |
| & | 36 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 222198 | |
| 0 | 175576 | |
| 2 | 157851 | |
| 5 | 124586 | |
| 3 | 116675 | |
| 6 | 105408 | |
| 4 | 93349 | |
| 7 | 69091 | 5.9% |
| 8 | 55301 | 4.7% |
| 9 | 49435 | 4.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 200029 | |
| [ | 62 | < 0.1% |
| ‚ | 1 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 24365 | |
| + | 1165 | 4.6% |
| < | 4 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 200007 | |
| ] | 62 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 36140 | |
| – | 9 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 5156973 |
Format
| Value | Count | Frequency (%) |
| | 126 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 7 |
Control
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23626272 | |
| Common | 8029352 | 25.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2389818 | 10.1% |
| o | 2384544 | 10.1% |
| e | 1748111 | 7.4% |
| n | 1666496 | 7.1% |
| i | 1568945 | 6.6% |
| t | 1523016 | 6.4% |
| r | 1291111 | 5.5% |
| l | 964589 | 4.1% |
| u | 736911 | 3.1% |
| s | 725961 | 3.1% |
| Other values (66) | 8626770 |
Common
| Value | Count | Frequency (%) |
| 5156973 | ||
| , | 845140 | 10.5% |
| . | 367701 | 4.6% |
| 1 | 222198 | 2.8% |
| ( | 200029 | 2.5% |
| ) | 200007 | 2.5% |
| 0 | 175576 | 2.2% |
| 2 | 157851 | 2.0% |
| 5 | 124586 | 1.6% |
| 3 | 116675 | 1.5% |
| Other values (24) | 462616 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31626455 | |
| None | 29146 | 0.1% |
| Latin Ext Additional | 13 | < 0.1% |
| Punctuation | 10 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5156973 | ||
| a | 2389818 | 7.6% |
| o | 2384544 | 7.5% |
| e | 1748111 | 5.5% |
| n | 1666496 | 5.3% |
| i | 1568945 | 5.0% |
| t | 1523016 | 4.8% |
| r | 1291111 | 4.1% |
| l | 964589 | 3.0% |
| , | 845140 | 2.7% |
| Other values (73) | 12087712 |
None
| Value | Count | Frequency (%) |
| í | 24109 | |
| é | 1678 | 5.8% |
| á | 1098 | 3.8% |
| ñ | 788 | 2.7% |
| â | 452 | 1.6% |
| ó | 240 | 0.8% |
| ú | 196 | 0.7% |
| ô | 169 | 0.6% |
| | 126 | 0.4% |
| è | 59 | 0.2% |
| Other values (12) | 231 | 0.8% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ấ | 9 | |
| ạ | 2 | 15.4% |
| ể | 2 | 15.4% |
Punctuation
| Value | Count | Frequency (%) |
| – | 9 | |
| ‚ | 1 | 10.0% |
Missing 
| Distinct | 2882 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 331608 |
| Missing (%) | 56.8% |
| Memory size | 4.5 MiB |
Length
| Max length | 93 |
|---|---|
| Median length | 46 |
| Mean length | 7.093015246 |
| Min length | 3 |
Unique
| Unique | 530 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 4320 ft |
|---|---|
| 2nd row | 4351 ft |
| 3rd row | 2200 m |
| 4th row | 30-50 m |
| 5th row | 30 ft |
| Value | Count | Frequency (%) |
| ft | 191831 | |
| m | 59860 | 11.5% |
| ca | 13358 | 2.6% |
| 1100-1350 | 4058 | 0.8% |
| 200 | 3781 | 0.7% |
| 10 | 3450 | 0.7% |
| 3400 | 2848 | 0.5% |
| 3500 | 2819 | 0.5% |
| 20 | 2706 | 0.5% |
| 3600 | 2513 | 0.5% |
| Other values (2009) | 234300 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 376273 | |
| 268931 | ||
| t | 192412 | |
| f | 192004 | |
| 1 | 99566 | 5.6% |
| 3 | 96808 | 5.4% |
| 2 | 90988 | 5.1% |
| 4 | 83319 | 4.7% |
| 5 | 76675 | 4.3% |
| m | 59946 | 3.3% |
| Other values (47) | 254724 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 994929 | |
| Lowercase Letter | 481690 | |
| Space Separator | 268931 | 15.0% |
| Dash Punctuation | 30052 | 1.7% |
| Other Punctuation | 13757 | 0.8% |
| Close Punctuation | 1006 | 0.1% |
| Open Punctuation | 1006 | 0.1% |
| Math Symbol | 195 | < 0.1% |
| Uppercase Letter | 80 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 192412 | |
| f | 192004 | |
| m | 59946 | 12.4% |
| a | 14859 | 3.1% |
| c | 13366 | 2.8% |
| e | 3277 | 0.7% |
| l | 1590 | 0.3% |
| v | 1058 | 0.2% |
| s | 835 | 0.2% |
| o | 611 | 0.1% |
| Other values (15) | 1732 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 376273 | |
| 1 | 99566 | 10.0% |
| 3 | 96808 | 9.7% |
| 2 | 90988 | 9.1% |
| 4 | 83319 | 8.4% |
| 5 | 76675 | 7.7% |
| 6 | 59540 | 6.0% |
| 8 | 45372 | 4.6% |
| 7 | 38030 | 3.8% |
| 9 | 28358 | 2.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 23 | |
| S | 15 | |
| P | 12 | |
| G | 12 | |
| A | 10 | |
| D | 5 | 6.2% |
| L | 2 | 2.5% |
| M | 1 | 1.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 13576 | |
| , | 90 | 0.7% |
| / | 39 | 0.3% |
| ; | 22 | 0.2% |
| ? | 22 | 0.2% |
| ' | 6 | < 0.1% |
| ‡ | 2 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| < | 110 | |
| + | 75 | |
| = | 10 | 5.1% |
Space Separator
| Value | Count | Frequency (%) |
| 268931 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 30052 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1006 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1006 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1309876 | |
| Latin | 481770 | 26.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 192412 | |
| f | 192004 | |
| m | 59946 | 12.4% |
| a | 14859 | 3.1% |
| c | 13366 | 2.8% |
| e | 3277 | 0.7% |
| l | 1590 | 0.3% |
| v | 1058 | 0.2% |
| s | 835 | 0.2% |
| o | 611 | 0.1% |
| Other values (23) | 1812 | 0.4% |
Common
| Value | Count | Frequency (%) |
| 0 | 376273 | |
| 268931 | ||
| 1 | 99566 | 7.6% |
| 3 | 96808 | 7.4% |
| 2 | 90988 | 6.9% |
| 4 | 83319 | 6.4% |
| 5 | 76675 | 5.9% |
| 6 | 59540 | 4.5% |
| 8 | 45372 | 3.5% |
| 7 | 38030 | 2.9% |
| Other values (14) | 74374 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1791644 | |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 376273 | |
| 268931 | ||
| t | 192412 | |
| f | 192004 | |
| 1 | 99566 | 5.6% |
| 3 | 96808 | 5.4% |
| 2 | 90988 | 5.1% |
| 4 | 83319 | 4.7% |
| 5 | 76675 | 4.3% |
| m | 59946 | 3.3% |
| Other values (46) | 254722 |
Punctuation
| Value | Count | Frequency (%) |
| ‡ | 2 |
decimalLatitude
Text
Missing 
| Distinct | 24490 |
|---|---|
| Distinct (%) | 5.8% |
| Missing | 162667 |
| Missing (%) | 27.8% |
| Memory size | 4.5 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 7 |
| Mean length | 6.90323675 |
| Min length | 3 |
Unique
| Unique | 8226 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | -8.8201 |
|---|---|
| 2nd row | 35.8083 |
| 3rd row | 12.0217 |
| 4th row | 38.39 |
| 5th row | 40.9580375 |
| Value | Count | Frequency (%) |
| 39.6306 | 4296 | 1.0% |
| 13.6389 | 2247 | 0.5% |
| 39.8872 | 1888 | 0.4% |
| 12.83 | 1754 | 0.4% |
| 26.9844 | 1718 | 0.4% |
| 4.0147 | 1664 | 0.4% |
| 37.4161 | 1535 | 0.4% |
| 36.7631 | 1511 | 0.4% |
| 25.4017 | 1483 | 0.4% |
| 36.9486 | 1468 | 0.3% |
| Other values (24041) | 401970 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 480691 | |
| . | 421534 | |
| 1 | 245474 | |
| 6 | 236090 | |
| 8 | 232386 | |
| 4 | 230710 | |
| 5 | 230378 | |
| 7 | 210588 | |
| 2 | 210386 | |
| 9 | 198067 | |
| Other values (3) | 213645 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2429298 | |
| Other Punctuation | 421534 | 14.5% |
| Dash Punctuation | 59054 | 2.0% |
| Uppercase Letter | 63 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 480691 | |
| 1 | 245474 | |
| 6 | 236090 | |
| 8 | 232386 | |
| 4 | 230710 | |
| 5 | 230378 | |
| 7 | 210588 | |
| 2 | 210386 | |
| 9 | 198067 | |
| 0 | 154528 | 6.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 421534 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 59054 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 63 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2909886 | |
| Latin | 63 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 480691 | |
| . | 421534 | |
| 1 | 245474 | |
| 6 | 236090 | |
| 8 | 232386 | |
| 4 | 230710 | |
| 5 | 230378 | |
| 7 | 210588 | |
| 2 | 210386 | |
| 9 | 198067 | |
| Other values (2) | 213582 |
Latin
| Value | Count | Frequency (%) |
| E | 63 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2909949 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 480691 | |
| . | 421534 | |
| 1 | 245474 | |
| 6 | 236090 | |
| 8 | 232386 | |
| 4 | 230710 | |
| 5 | 230378 | |
| 7 | 210588 | |
| 2 | 210386 | |
| 9 | 198067 | |
| Other values (3) | 213645 |
decimalLongitude
Text
Missing 
| Distinct | 24797 |
|---|---|
| Distinct (%) | 5.9% |
| Missing | 162667 |
| Missing (%) | 27.8% |
| Memory size | 4.5 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 8 |
| Mean length | 7.814814463 |
| Min length | 3 |
Unique
| Unique | 8175 ? |
|---|---|
| Unique (%) | 1.9% |
Sample
| 1st row | 146.53 |
|---|---|
| 2nd row | -82.3481 |
| 3rd row | -61.7664 |
| 4th row | -79.25 |
| 5th row | -115.4346518 |
| Value | Count | Frequency (%) |
| 77.4714 | 4296 | 1.0% |
| 144.962 | 2247 | 0.5% |
| 77.7786 | 2139 | 0.5% |
| 87.1889 | 1888 | 0.4% |
| 69.28 | 1763 | 0.4% |
| 81.4919 | 1718 | 0.4% |
| 80.5097 | 1653 | 0.4% |
| 81.2228 | 1509 | 0.4% |
| 80.6567 | 1483 | 0.4% |
| 79.5561 | 1463 | 0.3% |
| Other values (24682) | 401375 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 421534 | |
| 7 | 386519 | |
| - | 382134 | |
| 8 | 368334 | |
| 1 | 264538 | |
| 3 | 247782 | |
| 6 | 236340 | |
| 9 | 222019 | |
| 4 | 208966 | |
| 5 | 204225 | |
| Other values (2) | 351819 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2490542 | |
| Other Punctuation | 421534 | 12.8% |
| Dash Punctuation | 382134 | 11.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 386519 | |
| 8 | 368334 | |
| 1 | 264538 | |
| 3 | 247782 | |
| 6 | 236340 | |
| 9 | 222019 | |
| 4 | 208966 | |
| 5 | 204225 | |
| 2 | 203737 | |
| 0 | 148082 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 421534 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 382134 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3294210 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 421534 | |
| 7 | 386519 | |
| - | 382134 | |
| 8 | 368334 | |
| 1 | 264538 | |
| 3 | 247782 | |
| 6 | 236340 | |
| 9 | 222019 | |
| 4 | 208966 | |
| 5 | 204225 | |
| Other values (2) | 351819 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3294210 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 421534 | |
| 7 | 386519 | |
| - | 382134 | |
| 8 | 368334 | |
| 1 | 264538 | |
| 3 | 247782 | |
| 6 | 236340 | |
| 9 | 222019 | |
| 4 | 208966 | |
| 5 | 204225 | |
| Other values (2) | 351819 |
coordinateUncertaintyInMeters
Text
Missing 
| Distinct | 7350 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 439218 |
| Missing (%) | 75.2% |
| Memory size | 4.5 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 6.368484581 |
| Min length | 3 |
Unique
| Unique | 2148 ? |
|---|---|
| Unique (%) | 1.5% |
Sample
| 1st row | 402.34 |
|---|---|
| 2nd row | 96.56 |
| 3rd row | 152901.0 |
| 4th row | 6115.0 |
| 5th row | 1754.18 |
| Value | Count | Frequency (%) |
| 347.62 | 1384 | 1.0% |
| 186.68 | 1338 | 0.9% |
| 4615.0 | 1110 | 0.8% |
| 5615.0 | 1066 | 0.7% |
| 1066.0 | 1030 | 0.7% |
| 3615.0 | 978 | 0.7% |
| 5115.0 | 953 | 0.7% |
| 4115.0 | 946 | 0.7% |
| 177.03 | 882 | 0.6% |
| 402.34 | 826 | 0.6% |
| Other values (7340) | 134470 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 144983 | |
| 0 | 110601 | |
| 1 | 109273 | |
| 2 | 82741 | |
| 5 | 79271 | |
| 3 | 75055 | |
| 4 | 74981 | |
| 6 | 67932 | |
| 9 | 62051 | |
| 8 | 58564 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 778339 | |
| Other Punctuation | 144983 | 15.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 110601 | |
| 1 | 109273 | |
| 2 | 82741 | |
| 5 | 79271 | |
| 3 | 75055 | |
| 4 | 74981 | |
| 6 | 67932 | |
| 9 | 62051 | |
| 8 | 58564 | |
| 7 | 57870 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 144983 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 923322 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 144983 | |
| 0 | 110601 | |
| 1 | 109273 | |
| 2 | 82741 | |
| 5 | 79271 | |
| 3 | 75055 | |
| 4 | 74981 | |
| 6 | 67932 | |
| 9 | 62051 | |
| 8 | 58564 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 923322 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 144983 | |
| 0 | 110601 | |
| 1 | 109273 | |
| 2 | 82741 | |
| 5 | 79271 | |
| 3 | 75055 | |
| 4 | 74981 | |
| 6 | 67932 | |
| 9 | 62051 | |
| 8 | 58564 |
Missing 
| Distinct | 3371 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 439136 |
| Missing (%) | 75.2% |
| Memory size | 4.5 MiB |
Length
| Max length | 302 |
|---|---|
| Median length | 251 |
| Mean length | 91.26128977 |
| Min length | 3 |
Unique
| Unique | 891 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | USGS Palo Alto Quad (TopoZone - 1:24,000), MaNIS/HerpNET/ORNIS Georeferencing Guidelines |
|---|---|
| 2nd row | Terrain Navigator v. 5.03 USGS 1:24,000, MaNIS/HerpNET/ORNIS Georeferencing Guidelines |
| 3rd row | Alexandria Digital Library Gazetteer, MaNIS/HerpNET/ORNIS Georeferencing Guidelines |
| 4th row | USGS Chesterfield Quad (TopoZine - 1:24,000), MaNIS/HerpNET/ORNIS Georeferencing Guidelines |
| 5th row | USGS Falls Church Quad (TopoZone - 1:24,000), MaNIS/HerpNET/ORNIS Georeferencing Guidelines |
| Value | Count | Frequency (%) |
| georeferencing | 134216 | 9.7% |
| manis/herpnet/ornis | 134163 | 9.7% |
| guidelines | 134143 | 9.7% |
| usgs | 59079 | 4.3% |
| 1:24,000 | 54333 | 3.9% |
| 44136 | 3.2% | |
| quad | 39827 | 2.9% |
| digital | 22588 | 1.6% |
| gazetteer | 22105 | 1.6% |
| topozone | 21638 | 1.6% |
| Other values (3792) | 715459 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1320173 | 10.0% |
| 1236622 | 9.3% | |
| r | 733799 | 5.5% |
| i | 691510 | 5.2% |
| a | 629206 | 4.8% |
| n | 622138 | 4.7% |
| o | 500801 | 3.8% |
| N | 461182 | 3.5% |
| S | 454207 | 3.4% |
| G | 414644 | 3.1% |
| Other values (76) | 6174537 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7136568 | |
| Uppercase Letter | 3060694 | |
| Space Separator | 1236622 | 9.3% |
| Decimal Number | 835786 | 6.3% |
| Other Punctuation | 760937 | 5.7% |
| Open Punctuation | 71491 | 0.5% |
| Close Punctuation | 71272 | 0.5% |
| Dash Punctuation | 65161 | 0.5% |
| Connector Punctuation | 248 | < 0.1% |
| Math Symbol | 40 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1320173 | |
| r | 733799 | |
| i | 691510 | |
| a | 629206 | |
| n | 622138 | |
| o | 500801 | 7.0% |
| l | 307980 | 4.3% |
| d | 294924 | 4.1% |
| t | 258449 | 3.6% |
| g | 250955 | 3.5% |
| Other values (19) | 1526633 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 461182 | |
| S | 454207 | |
| G | 414644 | |
| I | 303621 | |
| T | 221610 | |
| M | 189899 | |
| E | 166244 | 5.4% |
| O | 161796 | 5.3% |
| R | 151192 | 4.9% |
| H | 140237 | 4.6% |
| Other values (17) | 396062 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 286534 | |
| , | 258402 | |
| : | 100996 | 13.3% |
| . | 80708 | 10.6% |
| ; | 15057 | 2.0% |
| ! | 9034 | 1.2% |
| # | 6647 | 0.9% |
| ' | 2637 | 0.3% |
| & | 813 | 0.1% |
| ? | 94 | < 0.1% |
| Other values (3) | 15 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 379892 | |
| 1 | 133690 | 16.0% |
| 2 | 100269 | 12.0% |
| 4 | 76915 | 9.2% |
| 5 | 38693 | 4.6% |
| 7 | 25544 | 3.1% |
| 9 | 22590 | 2.7% |
| 6 | 22338 | 2.7% |
| 3 | 22202 | 2.7% |
| 8 | 13653 | 1.6% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 24 | |
| = | 16 |
Space Separator
| Value | Count | Frequency (%) |
| 1236622 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 71491 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 71272 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 65161 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 248 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10197262 | |
| Common | 3041557 | 23.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1320173 | 12.9% |
| r | 733799 | 7.2% |
| i | 691510 | 6.8% |
| a | 629206 | 6.2% |
| n | 622138 | 6.1% |
| o | 500801 | 4.9% |
| N | 461182 | 4.5% |
| S | 454207 | 4.5% |
| G | 414644 | 4.1% |
| l | 307980 | 3.0% |
| Other values (46) | 4061622 |
Common
| Value | Count | Frequency (%) |
| 1236622 | ||
| 0 | 379892 | 12.5% |
| / | 286534 | 9.4% |
| , | 258402 | 8.5% |
| 1 | 133690 | 4.4% |
| : | 100996 | 3.3% |
| 2 | 100269 | 3.3% |
| . | 80708 | 2.7% |
| 4 | 76915 | 2.5% |
| ( | 71491 | 2.4% |
| Other values (20) | 316038 | 10.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13234776 | |
| None | 4039 | < 0.1% |
| Punctuation | 4 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1320173 | 10.0% |
| 1236622 | 9.3% | |
| r | 733799 | 5.5% |
| i | 691510 | 5.2% |
| a | 629206 | 4.8% |
| n | 622138 | 4.7% |
| o | 500801 | 3.8% |
| N | 461182 | 3.5% |
| S | 454207 | 3.4% |
| G | 414644 | 3.1% |
| Other values (71) | 6170494 |
None
| Value | Count | Frequency (%) |
| í | 4030 | |
| é | 5 | 0.1% |
| ô | 2 | < 0.1% |
| Î | 2 | < 0.1% |
Punctuation
| Value | Count | Frequency (%) |
| ‡ | 4 |
Missing 
| Distinct | 3681 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 443625 |
| Missing (%) | 75.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 83 |
|---|---|
| Median length | 55 |
| Mean length | 22.53162702 |
| Min length | 7 |
Unique
| Unique | 1057 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | Locality extent = 0.05 |
|---|---|
| 2nd row | Locality extent = 95 |
| 3rd row | Locality extent = 3.5 |
| 4th row | Datum Guam 63 |
| 5th row | Locality extent = 1.08 |
| Value | Count | Frequency (%) |
| extent | 134257 | |
| 134207 | ||
| locality | 134203 | |
| mi | 40072 | 6.6% |
| km | 8736 | 1.4% |
| 0.1 | 7251 | 1.2% |
| datum | 6200 | 1.0% |
| 63 | 5497 | 0.9% |
| guam | 5494 | 0.9% |
| 1 | 5323 | 0.9% |
| Other values (2938) | 128798 |
Most occurring characters
| Value | Count | Frequency (%) |
| 469462 | ||
| t | 411232 | |
| e | 269464 | 8.5% |
| i | 175099 | 5.5% |
| . | 149589 | 4.7% |
| a | 146541 | 4.6% |
| l | 134689 | 4.3% |
| n | 134567 | 4.2% |
| o | 134447 | 4.2% |
| y | 134376 | 4.2% |
| Other values (54) | 1007940 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1896549 | |
| Space Separator | 469462 | 14.8% |
| Decimal Number | 368654 | 11.6% |
| Other Punctuation | 149871 | 4.7% |
| Uppercase Letter | 148496 | 4.7% |
| Math Symbol | 134208 | 4.2% |
| Dash Punctuation | 72 | < 0.1% |
| Open Punctuation | 47 | < 0.1% |
| Close Punctuation | 47 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 411232 | |
| e | 269464 | |
| i | 175099 | |
| a | 146541 | 7.7% |
| l | 134689 | 7.1% |
| n | 134567 | 7.1% |
| o | 134447 | 7.1% |
| y | 134376 | 7.1% |
| x | 134300 | 7.1% |
| c | 134263 | 7.1% |
| Other values (14) | 87571 | 4.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 134266 | |
| G | 6166 | 4.2% |
| D | 6026 | 4.1% |
| S | 774 | 0.5% |
| W | 687 | 0.5% |
| H | 144 | 0.1% |
| N | 119 | 0.1% |
| P | 107 | 0.1% |
| E | 71 | < 0.1% |
| A | 37 | < 0.1% |
| Other values (9) | 99 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 74829 | |
| 1 | 61579 | |
| 5 | 51221 | |
| 2 | 46439 | |
| 3 | 35147 | |
| 6 | 23996 | 6.5% |
| 4 | 21925 | 5.9% |
| 7 | 21708 | 5.9% |
| 8 | 19177 | 5.2% |
| 9 | 12633 | 3.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 149589 | |
| ; | 174 | 0.1% |
| , | 71 | < 0.1% |
| : | 19 | < 0.1% |
| / | 12 | < 0.1% |
| ' | 6 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 469462 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 134208 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 72 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 47 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 47 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2045045 | |
| Common | 1122361 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 411232 | |
| e | 269464 | |
| i | 175099 | |
| a | 146541 | 7.2% |
| l | 134689 | 6.6% |
| n | 134567 | 6.6% |
| o | 134447 | 6.6% |
| y | 134376 | 6.6% |
| x | 134300 | 6.6% |
| L | 134266 | 6.6% |
| Other values (33) | 236064 |
Common
| Value | Count | Frequency (%) |
| 469462 | ||
| . | 149589 | 13.3% |
| = | 134208 | 12.0% |
| 0 | 74829 | 6.7% |
| 1 | 61579 | 5.5% |
| 5 | 51221 | 4.6% |
| 2 | 46439 | 4.1% |
| 3 | 35147 | 3.1% |
| 6 | 23996 | 2.1% |
| 4 | 21925 | 2.0% |
| Other values (11) | 53966 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3167406 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 469462 | ||
| t | 411232 | |
| e | 269464 | 8.5% |
| i | 175099 | 5.5% |
| . | 149589 | 4.7% |
| a | 146541 | 4.6% |
| l | 134689 | 4.3% |
| n | 134567 | 4.2% |
| o | 134447 | 4.2% |
| y | 134376 | 4.2% |
| Other values (54) | 1007940 |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 583784 |
| Missing (%) | 99.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 3 |
| Mean length | 3.167865707 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | aff. |
|---|---|
| 2nd row | cf. |
| 3rd row | cf. |
| 4th row | cf. |
| 5th row | cf. |
| Value | Count | Frequency (%) |
| cf | 382 | |
| aff | 28 | 6.7% |
| uncertain | 7 | 1.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 438 | |
| . | 410 | |
| c | 389 | |
| a | 35 | 2.6% |
| n | 14 | 1.1% |
| u | 7 | 0.5% |
| e | 7 | 0.5% |
| r | 7 | 0.5% |
| t | 7 | 0.5% |
| i | 7 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 911 | |
| Other Punctuation | 410 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 438 | |
| c | 389 | |
| a | 35 | 3.8% |
| n | 14 | 1.5% |
| u | 7 | 0.8% |
| e | 7 | 0.8% |
| r | 7 | 0.8% |
| t | 7 | 0.8% |
| i | 7 | 0.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 410 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 911 | |
| Common | 410 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| f | 438 | |
| c | 389 | |
| a | 35 | 3.8% |
| n | 14 | 1.5% |
| u | 7 | 0.8% |
| e | 7 | 0.8% |
| r | 7 | 0.8% |
| t | 7 | 0.8% |
| i | 7 | 0.8% |
Common
| Value | Count | Frequency (%) |
| . | 410 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1321 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 438 | |
| . | 410 | |
| c | 389 | |
| a | 35 | 2.6% |
| n | 14 | 1.1% |
| u | 7 | 0.5% |
| e | 7 | 0.5% |
| r | 7 | 0.5% |
| t | 7 | 0.5% |
| i | 7 | 0.5% |
typeStatus
Text
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 571070 |
| Missing (%) | 97.8% |
| Memory size | 4.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 8.014698043 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PARATYPE |
|---|---|
| 2nd row | PARATYPE |
| 3rd row | PARATYPE |
| 4th row | PARATYPE |
| 5th row | PARALECTOTYPE |
| Value | Count | Frequency (%) |
| paratype | 10832 | |
| holotype | 1222 | 9.3% |
| syntype | 835 | 6.4% |
| paralectotype | 208 | 1.6% |
| neotype | 23 | 0.2% |
| lectotype | 11 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 24171 | |
| A | 22080 | |
| Y | 13966 | |
| E | 13373 | |
| T | 13350 | |
| R | 11040 | |
| O | 2686 | 2.6% |
| L | 1441 | 1.4% |
| H | 1222 | 1.2% |
| N | 858 | 0.8% |
| Other values (2) | 1054 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 105241 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 24171 | |
| A | 22080 | |
| Y | 13966 | |
| E | 13373 | |
| T | 13350 | |
| R | 11040 | |
| O | 2686 | 2.6% |
| L | 1441 | 1.4% |
| H | 1222 | 1.2% |
| N | 858 | 0.8% |
| Other values (2) | 1054 | 1.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 105241 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 24171 | |
| A | 22080 | |
| Y | 13966 | |
| E | 13373 | |
| T | 13350 | |
| R | 11040 | |
| O | 2686 | 2.6% |
| L | 1441 | 1.4% |
| H | 1222 | 1.2% |
| N | 858 | 0.8% |
| Other values (2) | 1054 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 105241 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 24171 | |
| A | 22080 | |
| Y | 13966 | |
| E | 13373 | |
| T | 13350 | |
| R | 11040 | |
| O | 2686 | 2.6% |
| L | 1441 | 1.4% |
| H | 1222 | 1.2% |
| N | 858 | 0.8% |
| Other values (2) | 1054 | 1.0% |
identifiedBy
Text
Missing 
| Distinct | 8 |
|---|---|
| Distinct (%) | 10.5% |
| Missing | 584125 |
| Missing (%) | > 99.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 122 |
|---|---|
| Median length | 18 |
| Mean length | 25.17105263 |
| Min length | 14 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 5.3% |
Sample
| 1st row | Gower, David, (BMNH), The Natural History Museum (UNITED KINGDOM) |
|---|---|
| 2nd row | Crombie, Ronald I. |
| 3rd row | Crombie, Ronald I. |
| 4th row | Crombie, Ronald I. |
| 5th row | Crombie, Ronald I. |
| Value | Count | Frequency (%) |
| ronald | 56 | |
| crombie | 55 | |
| i | 55 | |
| natural | 11 | 3.7% |
| history | 11 | 3.7% |
| museum | 11 | 3.7% |
| united | 11 | 3.7% |
| gower | 10 | 3.3% |
| david | 10 | 3.3% |
| bmnh | 10 | 3.3% |
| Other values (26) | 60 |
Most occurring characters
| Value | Count | Frequency (%) |
| 224 | 11.7% | |
| o | 146 | 7.6% |
| e | 102 | 5.3% |
| r | 99 | 5.2% |
| , | 98 | 5.1% |
| a | 95 | 5.0% |
| i | 87 | 4.5% |
| I | 77 | 4.0% |
| n | 73 | 3.8% |
| d | 73 | 3.8% |
| Other values (39) | 839 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1027 | |
| Uppercase Letter | 452 | |
| Space Separator | 224 | 11.7% |
| Other Punctuation | 163 | 8.5% |
| Close Punctuation | 22 | 1.2% |
| Open Punctuation | 22 | 1.2% |
| Dash Punctuation | 3 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 77 | |
| R | 61 | |
| C | 58 | |
| N | 43 | |
| M | 31 | |
| D | 31 | |
| H | 27 | 6.0% |
| G | 24 | 5.3% |
| T | 23 | 5.1% |
| E | 14 | 3.1% |
| Other values (12) | 63 |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 146 | |
| e | 102 | |
| r | 99 | |
| a | 95 | |
| i | 87 | |
| n | 73 | |
| d | 73 | |
| l | 69 | |
| m | 68 | |
| b | 56 | 5.5% |
| Other values (11) | 159 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 98 | |
| . | 65 |
Space Separator
| Value | Count | Frequency (%) |
| 224 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 22 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 22 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1479 | |
| Common | 434 | 22.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 146 | 9.9% |
| e | 102 | 6.9% |
| r | 99 | 6.7% |
| a | 95 | 6.4% |
| i | 87 | 5.9% |
| I | 77 | 5.2% |
| n | 73 | 4.9% |
| d | 73 | 4.9% |
| l | 69 | 4.7% |
| m | 68 | 4.6% |
| Other values (33) | 590 |
Common
| Value | Count | Frequency (%) |
| 224 | ||
| , | 98 | |
| . | 65 | 15.0% |
| ) | 22 | 5.1% |
| ( | 22 | 5.1% |
| - | 3 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1913 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 224 | 11.7% | |
| o | 146 | 7.6% |
| e | 102 | 5.3% |
| r | 99 | 5.2% |
| , | 98 | 5.1% |
| a | 95 | 5.0% |
| i | 87 | 4.5% |
| I | 77 | 4.0% |
| n | 73 | 3.8% |
| d | 73 | 3.8% |
| Other values (39) | 839 |
| Distinct | 8475 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.019563472 |
| Min length | 1 |
Unique
| Unique | 1520 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 5225055 |
|---|---|
| 2nd row | 2431506 |
| 3rd row | 5224383 |
| 4th row | 2446249 |
| 5th row | 2467415 |
| Value | Count | Frequency (%) |
| 2431491 | 75714 | 13.0% |
| 2431539 | 13092 | 2.2% |
| 2431224 | 10146 | 1.7% |
| 2431506 | 9986 | 1.7% |
| 2431516 | 8012 | 1.4% |
| 2431529 | 7074 | 1.2% |
| 2431489 | 6103 | 1.0% |
| 2431484 | 5929 | 1.0% |
| 2431219 | 4681 | 0.8% |
| 2431510 | 4614 | 0.8% |
| Other values (8465) | 438850 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 850954 | |
| 4 | 741903 | |
| 1 | 558280 | |
| 3 | 457912 | |
| 5 | 330769 | 8.1% |
| 9 | 302439 | 7.4% |
| 6 | 235990 | 5.8% |
| 8 | 216124 | 5.3% |
| 7 | 207914 | 5.1% |
| 0 | 198551 | 4.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4100836 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 850954 | |
| 4 | 741903 | |
| 1 | 558280 | |
| 3 | 457912 | |
| 5 | 330769 | 8.1% |
| 9 | 302439 | 7.4% |
| 6 | 235990 | 5.8% |
| 8 | 216124 | 5.3% |
| 7 | 207914 | 5.1% |
| 0 | 198551 | 4.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4100836 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 850954 | |
| 4 | 741903 | |
| 1 | 558280 | |
| 3 | 457912 | |
| 5 | 330769 | 8.1% |
| 9 | 302439 | 7.4% |
| 6 | 235990 | 5.8% |
| 8 | 216124 | 5.3% |
| 7 | 207914 | 5.1% |
| 0 | 198551 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4100836 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 850954 | |
| 4 | 741903 | |
| 1 | 558280 | |
| 3 | 457912 | |
| 5 | 330769 | 8.1% |
| 9 | 302439 | 7.4% |
| 6 | 235990 | 5.8% |
| 8 | 216124 | 5.3% |
| 7 | 207914 | 5.1% |
| 0 | 198551 | 4.8% |
scientificName
Text
| Distinct | 9012 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 182 |
|---|---|
| Median length | 112 |
| Mean length | 35.63831969 |
| Min length | 5 |
Unique
| Unique | 1713 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Carlia bicarinata (Macleay, 1877) |
|---|---|
| 2nd row | Plethodon montanus Highton & Peabody, 2000 |
| 3rd row | Enhydris enhydris (Schneider, 1799) |
| 4th row | Gehyra mutilata (Wiegmann, 1834) |
| 5th row | Anolis richardii Duméril & Bibron, 1837 |
| Value | Count | Frequency (%) |
| plethodon | 168423 | 6.7% |
| green | 95287 | 3.8% |
| 1818 | 93378 | 3.7% |
| 81423 | 3.2% | |
| cinereus | 75774 | 3.0% |
| desmognathus | 35846 | 1.4% |
| cope | 33117 | 1.3% |
| duméril | 26833 | 1.1% |
| linnaeus | 26096 | 1.0% |
| bibron | 23820 | 0.9% |
| Other values (8722) | 1859231 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1935027 | 9.3% | |
| e | 1567127 | 7.5% |
| o | 1187336 | 5.7% |
| n | 1168671 | 5.6% |
| a | 1154833 | 5.5% |
| i | 1117013 | 5.4% |
| s | 1062494 | 5.1% |
| r | 1046115 | 5.0% |
| t | 861966 | 4.1% |
| l | 827376 | 4.0% |
| Other values (78) | 8891984 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13925691 | |
| Decimal Number | 2300144 | 11.0% |
| Space Separator | 1935027 | 9.3% |
| Uppercase Letter | 1279500 | 6.1% |
| Other Punctuation | 668876 | 3.2% |
| Open Punctuation | 351966 | 1.7% |
| Close Punctuation | 351966 | 1.7% |
| Dash Punctuation | 6772 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1567127 | |
| o | 1187336 | 8.5% |
| n | 1168671 | 8.4% |
| a | 1154833 | 8.3% |
| i | 1117013 | 8.0% |
| s | 1062494 | 7.6% |
| r | 1046115 | 7.5% |
| t | 861966 | 6.2% |
| l | 827376 | 5.9% |
| u | 775008 | 5.6% |
| Other values (32) | 3157752 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 235098 | |
| G | 164331 | |
| D | 110463 | |
| B | 109196 | |
| L | 97183 | |
| S | 85301 | 6.7% |
| H | 85115 | 6.7% |
| C | 81625 | 6.4% |
| A | 65870 | 5.1% |
| E | 36637 | 2.9% |
| Other values (18) | 208681 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 722241 | |
| 8 | 557749 | |
| 9 | 226895 | 9.9% |
| 2 | 147848 | 6.4% |
| 0 | 133680 | 5.8% |
| 5 | 118630 | 5.2% |
| 7 | 113116 | 4.9% |
| 6 | 102351 | 4.4% |
| 3 | 97005 | 4.2% |
| 4 | 80629 | 3.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 585483 | |
| & | 81423 | 12.2% |
| . | 1066 | 0.2% |
| ' | 904 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1935027 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 351966 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 351966 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6772 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15205191 | |
| Common | 5614751 | 27.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1567127 | 10.3% |
| o | 1187336 | 7.8% |
| n | 1168671 | 7.7% |
| a | 1154833 | 7.6% |
| i | 1117013 | 7.3% |
| s | 1062494 | 7.0% |
| r | 1046115 | 6.9% |
| t | 861966 | 5.7% |
| l | 827376 | 5.4% |
| u | 775008 | 5.1% |
| Other values (60) | 4437252 |
Common
| Value | Count | Frequency (%) |
| 1935027 | ||
| 1 | 722241 | 12.9% |
| , | 585483 | 10.4% |
| 8 | 557749 | 9.9% |
| ( | 351966 | 6.3% |
| ) | 351966 | 6.3% |
| 9 | 226895 | 4.0% |
| 2 | 147848 | 2.6% |
| 0 | 133680 | 2.4% |
| 5 | 118630 | 2.1% |
| Other values (8) | 483266 | 8.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20775558 | |
| None | 44384 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1935027 | 9.3% | |
| e | 1567127 | 7.5% |
| o | 1187336 | 5.7% |
| n | 1168671 | 5.6% |
| a | 1154833 | 5.6% |
| i | 1117013 | 5.4% |
| s | 1062494 | 5.1% |
| r | 1046115 | 5.0% |
| t | 861966 | 4.1% |
| l | 827376 | 4.0% |
| Other values (60) | 8847600 |
None
| Value | Count | Frequency (%) |
| é | 29442 | |
| ü | 10886 | 24.5% |
| è | 1680 | 3.8% |
| ö | 1276 | 2.9% |
| Ö | 294 | 0.7% |
| í | 269 | 0.6% |
| ñ | 249 | 0.6% |
| á | 152 | 0.3% |
| ó | 71 | 0.2% |
| å | 20 | < 0.1% |
| Other values (8) | 45 | 0.1% |
| Distinct | 167 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 86 |
|---|---|
| Median length | 82 |
| Mean length | 66.44007265 |
| Min length | 10 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Animalia, Chordata, Vertebrata, Reptilia, Squamata, Sauria, Scincidae, Eugongylinae |
|---|---|
| 2nd row | Animalia, Chordata, Vertebrata, Amphibia, Caudata, Plethodontidae |
| 3rd row | Animalia, Chordata, Vertebrata, Reptilia, Squamata, Ophidia, Homalopsinae |
| 4th row | Animalia, Chordata, Vertebrata, Reptilia, Squamata, Sauria, Gekkoninae |
| 5th row | Animalia, Chordata, Vertebrata, Reptilia, Squamata, Sauria, Polychrotinae |
| Value | Count | Frequency (%) |
| animalia | 584195 | |
| vertebrata | 584195 | |
| chordata | 584178 | |
| amphibia | 395159 | |
| caudata | 237127 | |
| plethodontidae | 221369 | 5.9% |
| reptilia | 189036 | 5.1% |
| squamata | 169309 | 4.5% |
| anura | 157511 | 4.2% |
| sauria | 116154 | 3.1% |
| Other values (166) | 484544 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6566805 | |
| i | 3313617 | 8.5% |
| , | 3138578 | 8.1% |
| 3138578 | 8.1% | |
| t | 3000106 | 7.7% |
| e | 2360956 | 6.1% |
| r | 2244920 | 5.8% |
| d | 1648115 | 4.2% |
| h | 1357195 | 3.5% |
| n | 1355739 | 3.5% |
| Other values (36) | 10689615 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 28814291 | |
| Uppercase Letter | 3722777 | 9.6% |
| Other Punctuation | 3138578 | 8.1% |
| Space Separator | 3138578 | 8.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6566805 | |
| i | 3313617 | |
| t | 3000106 | |
| e | 2360956 | 8.2% |
| r | 2244920 | 7.8% |
| d | 1648115 | 5.7% |
| h | 1357195 | 4.7% |
| n | 1355739 | 4.7% |
| o | 1350378 | 4.7% |
| m | 1224848 | 4.3% |
| Other values (14) | 4391612 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1151930 | |
| C | 876519 | |
| V | 590792 | |
| S | 343033 | 9.2% |
| P | 265039 | 7.1% |
| R | 211930 | 5.7% |
| O | 52750 | 1.4% |
| H | 46430 | 1.2% |
| E | 33840 | 0.9% |
| T | 33424 | 0.9% |
| Other values (10) | 117090 | 3.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3138578 |
Space Separator
| Value | Count | Frequency (%) |
| 3138578 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32537068 | |
| Common | 6277156 | 16.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6566805 | |
| i | 3313617 | |
| t | 3000106 | 9.2% |
| e | 2360956 | 7.3% |
| r | 2244920 | 6.9% |
| d | 1648115 | 5.1% |
| h | 1357195 | 4.2% |
| n | 1355739 | 4.2% |
| o | 1350378 | 4.2% |
| m | 1224848 | 3.8% |
| Other values (34) | 8114389 |
Common
| Value | Count | Frequency (%) |
| , | 3138578 | |
| 3138578 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38814224 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 6566805 | |
| i | 3313617 | 8.5% |
| , | 3138578 | 8.1% |
| 3138578 | 8.1% | |
| t | 3000106 | 7.7% |
| e | 2360956 | 6.1% |
| r | 2244920 | 5.8% |
| d | 1648115 | 4.2% |
| h | 1357195 | 3.5% |
| n | 1355739 | 3.5% |
| Other values (36) | 10689615 |
kingdom
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Animalia |
|---|---|
| 2nd row | Animalia |
| 3rd row | Animalia |
| 4th row | Animalia |
| 5th row | Animalia |
| Value | Count | Frequency (%) |
| animalia | 584201 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 1168402 | |
| a | 1168402 | |
| A | 584201 | |
| n | 584201 | |
| m | 584201 | |
| l | 584201 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4089407 | |
| Uppercase Letter | 584201 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1168402 | |
| a | 1168402 | |
| n | 584201 | |
| m | 584201 | |
| l | 584201 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 584201 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4673608 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 1168402 | |
| a | 1168402 | |
| A | 584201 | |
| n | 584201 | |
| m | 584201 | |
| l | 584201 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4673608 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 1168402 | |
| a | 1168402 | |
| A | 584201 | |
| n | 584201 | |
| m | 584201 | |
| l | 584201 |
phylum
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Chordata |
|---|---|
| 2nd row | Chordata |
| 3rd row | Chordata |
| 4th row | Chordata |
| 5th row | Chordata |
| Value | Count | Frequency (%) |
| chordata | 584196 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1168392 | |
| C | 584196 | |
| h | 584196 | |
| o | 584196 | |
| r | 584196 | |
| d | 584196 | |
| t | 584196 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4089372 | |
| Uppercase Letter | 584196 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1168392 | |
| h | 584196 | |
| o | 584196 | |
| r | 584196 | |
| d | 584196 | |
| t | 584196 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 584196 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4673568 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1168392 | |
| C | 584196 | |
| h | 584196 | |
| o | 584196 | |
| r | 584196 | |
| d | 584196 | |
| t | 584196 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4673568 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1168392 | |
| C | 584196 | |
| h | 584196 | |
| o | 584196 | |
| r | 584196 | |
| d | 584196 | |
| t | 584196 |
class
Text
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 203 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 8 |
| Mean length | 8.067606396 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Squamata |
|---|---|
| 2nd row | Amphibia |
| 3rd row | Squamata |
| 4th row | Squamata |
| 5th row | Squamata |
| Value | Count | Frequency (%) |
| amphibia | 395161 | |
| squamata | 169110 | |
| testudines | 18909 | 3.2% |
| crocodylia | 804 | 0.1% |
| sphenodontia | 14 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 903309 | |
| i | 810049 | |
| m | 564271 | |
| p | 395175 | |
| h | 395175 | |
| A | 395161 | |
| b | 395161 | |
| t | 188033 | 4.0% |
| u | 188019 | 4.0% |
| S | 169124 | 3.6% |
| Other values (12) | 307989 | 6.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4127468 | |
| Uppercase Letter | 583998 | 12.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 903309 | |
| i | 810049 | |
| m | 564271 | |
| p | 395175 | |
| h | 395175 | |
| b | 395161 | |
| t | 188033 | 4.6% |
| u | 188019 | 4.6% |
| q | 169110 | 4.1% |
| e | 37832 | 0.9% |
| Other values (8) | 81334 | 2.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 395161 | |
| S | 169124 | |
| T | 18909 | 3.2% |
| C | 804 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4711466 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 903309 | |
| i | 810049 | |
| m | 564271 | |
| p | 395175 | |
| h | 395175 | |
| A | 395161 | |
| b | 395161 | |
| t | 188033 | 4.0% |
| u | 188019 | 4.0% |
| S | 169124 | 3.6% |
| Other values (12) | 307989 | 6.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4711466 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 903309 | |
| i | 810049 | |
| m | 564271 | |
| p | 395175 | |
| h | 395175 | |
| A | 395161 | |
| b | 395161 | |
| t | 188033 | 4.0% |
| u | 188019 | 4.0% |
| S | 169124 | 3.6% |
| Other values (12) | 307989 | 6.5% |
order
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 189040 |
| Missing (%) | 32.4% |
| Memory size | 4.5 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 7 |
| Mean length | 6.208074683 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Caudata |
|---|---|
| 2nd row | Caudata |
| 3rd row | Anura |
| 4th row | Anura |
| 5th row | Caudata |
| Value | Count | Frequency (%) |
| caudata | 237129 | |
| anura | 157511 | |
| gymnophiona | 521 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 869419 | |
| u | 394640 | |
| C | 237129 | 9.7% |
| d | 237129 | 9.7% |
| t | 237129 | 9.7% |
| n | 158553 | 6.5% |
| A | 157511 | 6.4% |
| r | 157511 | 6.4% |
| o | 1042 | < 0.1% |
| G | 521 | < 0.1% |
| Other values (5) | 2605 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2058028 | |
| Uppercase Letter | 395161 | 16.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 869419 | |
| u | 394640 | |
| d | 237129 | 11.5% |
| t | 237129 | 11.5% |
| n | 158553 | 7.7% |
| r | 157511 | 7.7% |
| o | 1042 | 0.1% |
| y | 521 | < 0.1% |
| m | 521 | < 0.1% |
| p | 521 | < 0.1% |
| Other values (2) | 1042 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 237129 | |
| A | 157511 | |
| G | 521 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2453189 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 869419 | |
| u | 394640 | |
| C | 237129 | 9.7% |
| d | 237129 | 9.7% |
| t | 237129 | 9.7% |
| n | 158553 | 6.5% |
| A | 157511 | 6.4% |
| r | 157511 | 6.4% |
| o | 1042 | < 0.1% |
| G | 521 | < 0.1% |
| Other values (5) | 2605 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2453189 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 869419 | |
| u | 394640 | |
| C | 237129 | 9.7% |
| d | 237129 | 9.7% |
| t | 237129 | 9.7% |
| n | 158553 | 6.5% |
| A | 157511 | 6.4% |
| r | 157511 | 6.4% |
| o | 1042 | < 0.1% |
| G | 521 | < 0.1% |
| Other values (5) | 2605 | 0.1% |
family
Text
| Distinct | 159 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 587 |
| Missing (%) | 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 19 |
| Mean length | 12.00749468 |
| Min length | 6 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Scincidae |
|---|---|
| 2nd row | Plethodontidae |
| 3rd row | Homalopsidae |
| 4th row | Gekkonidae |
| 5th row | Dactyloidae |
| Value | Count | Frequency (%) |
| plethodontidae | 221371 | |
| hylidae | 41566 | 7.1% |
| colubridae | 38793 | 6.6% |
| scincidae | 26153 | 4.5% |
| bufonidae | 25125 | 4.3% |
| ranidae | 20333 | 3.5% |
| dactyloidae | 18373 | 3.1% |
| gekkonidae | 17255 | 3.0% |
| phrynosomatidae | 16259 | 2.8% |
| leptodactylidae | 10435 | 1.8% |
| Other values (149) | 147951 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 917602 | |
| d | 865810 | |
| a | 768920 | |
| o | 698920 | |
| i | 662199 | |
| t | 582781 | |
| l | 411564 | 5.9% |
| n | 371028 | 5.3% |
| h | 296650 | 4.2% |
| P | 246994 | 3.5% |
| Other values (32) | 1185274 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6424128 | |
| Uppercase Letter | 583614 | 8.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 917602 | |
| d | 865810 | |
| a | 768920 | |
| o | 698920 | |
| i | 662199 | |
| t | 582781 | |
| l | 411564 | |
| n | 371028 | |
| h | 296650 | 4.6% |
| r | 160765 | 2.5% |
| Other values (12) | 687889 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 246994 | |
| C | 61224 | 10.5% |
| H | 46641 | 8.0% |
| S | 42563 | 7.3% |
| D | 32211 | 5.5% |
| B | 27193 | 4.7% |
| R | 22885 | 3.9% |
| E | 20369 | 3.5% |
| G | 20274 | 3.5% |
| A | 16289 | 2.8% |
| Other values (10) | 46971 | 8.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7007742 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 917602 | |
| d | 865810 | |
| a | 768920 | |
| o | 698920 | |
| i | 662199 | |
| t | 582781 | |
| l | 411564 | 5.9% |
| n | 371028 | 5.3% |
| h | 296650 | 4.2% |
| P | 246994 | 3.5% |
| Other values (32) | 1185274 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7007742 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 917602 | |
| d | 865810 | |
| a | 768920 | |
| o | 698920 | |
| i | 662199 | |
| t | 582781 | |
| l | 411564 | 5.9% |
| n | 371028 | 5.3% |
| h | 296650 | 4.2% |
| P | 246994 | 3.5% |
| Other values (32) | 1185274 |
genus
Text
| Distinct | 1416 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1685 |
| Missing (%) | 0.3% |
| Memory size | 4.5 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 16 |
| Mean length | 9.556288583 |
| Min length | 3 |
Unique
| Unique | 97 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Carlia |
|---|---|
| 2nd row | Plethodon |
| 3rd row | Enhydris |
| 4th row | Gehyra |
| 5th row | Anolis |
| Value | Count | Frequency (%) |
| plethodon | 168423 | |
| desmognathus | 35846 | 6.2% |
| anolis | 18373 | 3.2% |
| lithobates | 12991 | 2.2% |
| eleutherodactylus | 9948 | 1.7% |
| anaxyrus | 9476 | 1.6% |
| sceloporus | 8824 | 1.5% |
| emoia | 8233 | 1.4% |
| eurycea | 7667 | 1.3% |
| pseudacris | 6800 | 1.2% |
| Other values (1406) | 295935 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 686047 | |
| e | 456061 | 8.2% |
| t | 411539 | 7.4% |
| s | 401007 | 7.2% |
| l | 372079 | 6.7% |
| h | 362493 | 6.5% |
| a | 356491 | 6.4% |
| n | 349007 | 6.3% |
| d | 268142 | 4.8% |
| i | 230506 | 4.1% |
| Other values (42) | 1673319 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4984175 | |
| Uppercase Letter | 582516 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 686047 | |
| e | 456061 | |
| t | 411539 | 8.3% |
| s | 401007 | 8.0% |
| l | 372079 | 7.5% |
| h | 362493 | 7.3% |
| a | 356491 | 7.2% |
| n | 349007 | 7.0% |
| d | 268142 | 5.4% |
| i | 230506 | 4.6% |
| Other values (16) | 1090803 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 209915 | |
| A | 57202 | 9.8% |
| D | 53448 | 9.2% |
| L | 37037 | 6.4% |
| S | 34448 | 5.9% |
| E | 33842 | 5.8% |
| C | 32105 | 5.5% |
| H | 19213 | 3.3% |
| T | 18000 | 3.1% |
| B | 13873 | 2.4% |
| Other values (16) | 73433 | 12.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5566691 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 686047 | |
| e | 456061 | 8.2% |
| t | 411539 | 7.4% |
| s | 401007 | 7.2% |
| l | 372079 | 6.7% |
| h | 362493 | 6.5% |
| a | 356491 | 6.4% |
| n | 349007 | 6.3% |
| d | 268142 | 4.8% |
| i | 230506 | 4.1% |
| Other values (42) | 1673319 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5566691 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 686047 | |
| e | 456061 | 8.2% |
| t | 411539 | 7.4% |
| s | 401007 | 7.2% |
| l | 372079 | 6.7% |
| h | 362493 | 6.5% |
| a | 356491 | 6.4% |
| n | 349007 | 6.3% |
| d | 268142 | 4.8% |
| i | 230506 | 4.1% |
| Other values (42) | 1673319 |
genericName
Text
| Distinct | 1357 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1685 |
| Missing (%) | 0.3% |
| Memory size | 4.5 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 16 |
| Mean length | 9.513697478 |
| Min length | 3 |
Unique
| Unique | 124 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Carlia |
|---|---|
| 2nd row | Plethodon |
| 3rd row | Enhydris |
| 4th row | Gehyra |
| 5th row | Anolis |
| Value | Count | Frequency (%) |
| plethodon | 168423 | |
| desmognathus | 35846 | 6.2% |
| anolis | 18333 | 3.1% |
| lithobates | 12991 | 2.2% |
| eleutherodactylus | 9947 | 1.7% |
| anaxyrus | 9476 | 1.6% |
| sceloporus | 8824 | 1.5% |
| emoia | 8211 | 1.4% |
| eurycea | 7626 | 1.3% |
| pseudacris | 6800 | 1.2% |
| Other values (1347) | 296039 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 674207 | |
| e | 453829 | 8.2% |
| t | 411395 | 7.4% |
| s | 399277 | 7.2% |
| l | 371549 | 6.7% |
| a | 364650 | 6.6% |
| h | 357431 | 6.4% |
| n | 345478 | 6.2% |
| d | 268371 | 4.8% |
| i | 236254 | 4.3% |
| Other values (41) | 1659440 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4959365 | |
| Uppercase Letter | 582516 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 674207 | |
| e | 453829 | |
| t | 411395 | 8.3% |
| s | 399277 | 8.1% |
| l | 371549 | 7.5% |
| a | 364650 | 7.4% |
| h | 357431 | 7.2% |
| n | 345478 | 7.0% |
| d | 268371 | 5.4% |
| i | 236254 | 4.8% |
| Other values (16) | 1076924 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 210243 | |
| A | 59414 | 10.2% |
| D | 48886 | 8.4% |
| L | 39047 | 6.7% |
| E | 33655 | 5.8% |
| S | 33086 | 5.7% |
| C | 32219 | 5.5% |
| H | 26221 | 4.5% |
| T | 17245 | 3.0% |
| R | 13689 | 2.3% |
| Other values (15) | 68811 | 11.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5541881 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 674207 | |
| e | 453829 | 8.2% |
| t | 411395 | 7.4% |
| s | 399277 | 7.2% |
| l | 371549 | 6.7% |
| a | 364650 | 6.6% |
| h | 357431 | 6.4% |
| n | 345478 | 6.2% |
| d | 268371 | 4.8% |
| i | 236254 | 4.3% |
| Other values (41) | 1659440 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5541881 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 674207 | |
| e | 453829 | 8.2% |
| t | 411395 | 7.4% |
| s | 399277 | 7.2% |
| l | 371549 | 6.7% |
| a | 364650 | 6.6% |
| h | 357431 | 6.4% |
| n | 345478 | 6.2% |
| d | 268371 | 4.8% |
| i | 236254 | 4.3% |
| Other values (41) | 1659440 |
specificEpithet
Text
Missing 
| Distinct | 5069 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 15011 |
| Missing (%) | 2.6% |
| Memory size | 4.5 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 16 |
| Mean length | 8.818503487 |
| Min length | 3 |
Unique
| Unique | 752 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | bicarinata |
|---|---|
| 2nd row | montanus |
| 3rd row | enhydris |
| 4th row | mutilata |
| 5th row | richardii |
| Value | Count | Frequency (%) |
| cinereus | 75774 | 13.3% |
| glutinosus | 13098 | 2.3% |
| fuscus | 10996 | 1.9% |
| montanus | 10396 | 1.8% |
| jordani | 8582 | 1.5% |
| metcalfi | 6940 | 1.2% |
| cylindraceus | 6103 | 1.1% |
| carolinensis | 5850 | 1.0% |
| teyahalee | 5559 | 1.0% |
| septentrionalis | 4872 | 0.9% |
| Other values (5059) | 421020 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 543995 | |
| s | 515115 | |
| e | 488200 | |
| a | 483269 | |
| r | 401688 | |
| u | 396785 | |
| n | 359600 | 7.2% |
| c | 306019 | 6.1% |
| t | 278310 | 5.5% |
| o | 259596 | 5.2% |
| Other values (17) | 986827 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5018843 | |
| Dash Punctuation | 561 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 543995 | |
| s | 515115 | |
| e | 488200 | |
| a | 483269 | |
| r | 401688 | |
| u | 396785 | |
| n | 359600 | 7.2% |
| c | 306019 | 6.1% |
| t | 278310 | 5.5% |
| o | 259596 | 5.2% |
| Other values (16) | 986266 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 561 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5018843 | |
| Common | 561 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 543995 | |
| s | 515115 | |
| e | 488200 | |
| a | 483269 | |
| r | 401688 | |
| u | 396785 | |
| n | 359600 | 7.2% |
| c | 306019 | 6.1% |
| t | 278310 | 5.5% |
| o | 259596 | 5.2% |
| Other values (16) | 986266 |
Common
| Value | Count | Frequency (%) |
| - | 561 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5019404 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 543995 | |
| s | 515115 | |
| e | 488200 | |
| a | 483269 | |
| r | 401688 | |
| u | 396785 | |
| n | 359600 | 7.2% |
| c | 306019 | 6.1% |
| t | 278310 | 5.5% |
| o | 259596 | 5.2% |
| Other values (17) | 986827 |
Missing 
| Distinct | 1214 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 559230 |
| Missing (%) | 95.7% |
| Memory size | 4.5 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 14 |
| Mean length | 9.070041248 |
| Min length | 3 |
Unique
| Unique | 244 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | occidentalis |
|---|---|
| 2nd row | consobrinus |
| 3rd row | trinidadensis |
| 4th row | ignigularis |
| 5th row | metcalfi |
| Value | Count | Frequency (%) |
| viridescens | 1460 | 5.8% |
| blanchardi | 1205 | 4.8% |
| metcalfi | 1072 | 4.3% |
| fasciata | 1043 | 4.2% |
| elegans | 909 | 3.6% |
| stejnegeri | 388 | 1.6% |
| teyahalee | 370 | 1.5% |
| louisianensis | 365 | 1.5% |
| dorsalis | 340 | 1.4% |
| fuscus | 318 | 1.3% |
| Other values (1204) | 17501 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 26514 | |
| i | 26373 | |
| s | 22393 | |
| e | 19932 | 8.8% |
| n | 15025 | 6.6% |
| r | 14301 | 6.3% |
| l | 14245 | 6.3% |
| t | 12449 | 5.5% |
| c | 12257 | 5.4% |
| u | 11424 | 5.0% |
| Other values (16) | 51575 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 226488 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 26514 | |
| i | 26373 | |
| s | 22393 | |
| e | 19932 | 8.8% |
| n | 15025 | 6.6% |
| r | 14301 | 6.3% |
| l | 14245 | 6.3% |
| t | 12449 | 5.5% |
| c | 12257 | 5.4% |
| u | 11424 | 5.0% |
| Other values (16) | 51575 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 226488 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 26514 | |
| i | 26373 | |
| s | 22393 | |
| e | 19932 | 8.8% |
| n | 15025 | 6.6% |
| r | 14301 | 6.3% |
| l | 14245 | 6.3% |
| t | 12449 | 5.5% |
| c | 12257 | 5.4% |
| u | 11424 | 5.0% |
| Other values (16) | 51575 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 226488 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 26514 | |
| i | 26373 | |
| s | 22393 | |
| e | 19932 | 8.8% |
| n | 15025 | 6.6% |
| r | 14301 | 6.3% |
| l | 14245 | 6.3% |
| t | 12449 | 5.5% |
| c | 12257 | 5.4% |
| u | 11424 | 5.0% |
| Other values (16) | 51575 |
taxonRank
Text
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 7.079066965 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | SPECIES |
|---|---|
| 2nd row | SPECIES |
| 3rd row | SPECIES |
| 4th row | SPECIES |
| 5th row | SPECIES |
| Value | Count | Frequency (%) |
| species | 544219 | |
| subspecies | 24970 | 4.3% |
| genus | 13326 | 2.3% |
| family | 1101 | 0.2% |
| order | 379 | 0.1% |
| phylum | 198 | < 0.1% |
| class | 5 | < 0.1% |
| kingdom | 2 | < 0.1% |
| variety | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 1176684 | |
| E | 1152084 | |
| I | 570293 | |
| P | 569387 | |
| C | 569194 | |
| U | 38494 | 0.9% |
| B | 24970 | 0.6% |
| G | 13328 | 0.3% |
| N | 13328 | 0.3% |
| L | 1304 | < 0.1% |
| Other values (11) | 6532 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4135598 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1176684 | |
| E | 1152084 | |
| I | 570293 | |
| P | 569387 | |
| C | 569194 | |
| U | 38494 | 0.9% |
| B | 24970 | 0.6% |
| G | 13328 | 0.3% |
| N | 13328 | 0.3% |
| L | 1304 | < 0.1% |
| Other values (11) | 6532 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4135598 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 1176684 | |
| E | 1152084 | |
| I | 570293 | |
| P | 569387 | |
| C | 569194 | |
| U | 38494 | 0.9% |
| B | 24970 | 0.6% |
| G | 13328 | 0.3% |
| N | 13328 | 0.3% |
| L | 1304 | < 0.1% |
| Other values (11) | 6532 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4135598 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 1176684 | |
| E | 1152084 | |
| I | 570293 | |
| P | 569387 | |
| C | 569194 | |
| U | 38494 | 0.9% |
| B | 24970 | 0.6% |
| G | 13328 | 0.3% |
| N | 13328 | 0.3% |
| L | 1304 | < 0.1% |
| Other values (11) | 6532 | 0.2% |
taxonomicStatus
Text
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.914724555 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ACCEPTED |
|---|---|
| 2nd row | ACCEPTED |
| 3rd row | ACCEPTED |
| 4th row | ACCEPTED |
| 5th row | ACCEPTED |
| Value | Count | Frequency (%) |
| accepted | 534360 | |
| synonym | 49818 | 8.5% |
| doubtful | 23 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 1068720 | |
| E | 1068720 | |
| T | 534383 | |
| D | 534383 | |
| A | 534360 | |
| P | 534360 | |
| Y | 99636 | 2.2% |
| N | 99636 | 2.2% |
| O | 49841 | 1.1% |
| S | 49818 | 1.1% |
| Other values (5) | 49933 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4623790 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1068720 | |
| E | 1068720 | |
| T | 534383 | |
| D | 534383 | |
| A | 534360 | |
| P | 534360 | |
| Y | 99636 | 2.2% |
| N | 99636 | 2.2% |
| O | 49841 | 1.1% |
| S | 49818 | 1.1% |
| Other values (5) | 49933 | 1.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4623790 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 1068720 | |
| E | 1068720 | |
| T | 534383 | |
| D | 534383 | |
| A | 534360 | |
| P | 534360 | |
| Y | 99636 | 2.2% |
| N | 99636 | 2.2% |
| O | 49841 | 1.1% |
| S | 49818 | 1.1% |
| Other values (5) | 49933 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4623790 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 1068720 | |
| E | 1068720 | |
| T | 534383 | |
| D | 534383 | |
| A | 534360 | |
| P | 534360 | |
| Y | 99636 | 2.2% |
| N | 99636 | 2.2% |
| O | 49841 | 1.1% |
| S | 49818 | 1.1% |
| Other values (5) | 49933 | 1.1% |
datasetKey
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
|---|---|
| 2nd row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 3rd row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 4th row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 5th row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| Value | Count | Frequency (%) |
| 821cc27a-e3bb-4bc5-ac34-89ada245069d | 584201 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 2336804 | |
| a | 2336804 | |
| - | 2336804 | |
| 2 | 1752603 | |
| b | 1752603 | |
| 4 | 1752603 | |
| 8 | 1168402 | 5.6% |
| 3 | 1168402 | 5.6% |
| 5 | 1168402 | 5.6% |
| 9 | 1168402 | 5.6% |
| Other values (6) | 4089407 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10515618 | |
| Lowercase Letter | 8178814 | |
| Dash Punctuation | 2336804 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1752603 | |
| 4 | 1752603 | |
| 8 | 1168402 | |
| 3 | 1168402 | |
| 5 | 1168402 | |
| 9 | 1168402 | |
| 1 | 584201 | 5.6% |
| 7 | 584201 | 5.6% |
| 0 | 584201 | 5.6% |
| 6 | 584201 | 5.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 2336804 | |
| a | 2336804 | |
| b | 1752603 | |
| d | 1168402 | |
| e | 584201 | 7.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2336804 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12852422 | |
| Latin | 8178814 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 2336804 | |
| 2 | 1752603 | |
| 4 | 1752603 | |
| 8 | 1168402 | |
| 3 | 1168402 | |
| 5 | 1168402 | |
| 9 | 1168402 | |
| 1 | 584201 | 4.5% |
| 7 | 584201 | 4.5% |
| 0 | 584201 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| c | 2336804 | |
| a | 2336804 | |
| b | 1752603 | |
| d | 1168402 | |
| e | 584201 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21031236 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 2336804 | |
| a | 2336804 | |
| - | 2336804 | |
| 2 | 1752603 | |
| b | 1752603 | |
| 4 | 1752603 | |
| 8 | 1168402 | 5.6% |
| 3 | 1168402 | 5.6% |
| 5 | 1168402 | 5.6% |
| 9 | 1168402 | 5.6% |
| Other values (6) | 4089407 |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 584201 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 584201 | |
| S | 584201 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1168402 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 584201 | |
| S | 584201 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1168402 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 584201 | |
| S | 584201 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1168402 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 584201 | |
| S | 584201 |
lastInterpreted
Text
| Distinct | 186736 |
|---|---|
| Distinct (%) | 32.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99567957 |
| Min length | 20 |
Unique
| Unique | 40383 ? |
|---|---|
| Unique (%) | 6.9% |
Sample
| 1st row | 2024-12-02T13:56:06.739Z |
|---|---|
| 2nd row | 2024-12-02T13:56:08.224Z |
| 3rd row | 2024-12-02T13:55:56.801Z |
| 4th row | 2024-12-02T13:59:51.499Z |
| 5th row | 2024-12-02T13:58:04.592Z |
| Value | Count | Frequency (%) |
| 2024-12-02t13:57:45.601z | 17 | < 0.1% |
| 2024-12-02t13:57:52.847z | 16 | < 0.1% |
| 2024-12-02t13:57:54.221z | 16 | < 0.1% |
| 2024-12-02t13:57:23.249z | 16 | < 0.1% |
| 2024-12-02t13:57:51.135z | 16 | < 0.1% |
| 2024-12-02t13:57:50.745z | 15 | < 0.1% |
| 2024-12-02t13:58:01.663z | 15 | < 0.1% |
| 2024-12-02t13:56:52.538z | 15 | < 0.1% |
| 2024-12-02t13:57:30.398z | 15 | < 0.1% |
| 2024-12-02t13:57:53.169z | 15 | < 0.1% |
| Other values (186726) | 584045 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2668002 | |
| 0 | 1480784 | |
| 1 | 1472907 | |
| - | 1168402 | |
| : | 1168402 | |
| 4 | 939301 | 6.7% |
| 5 | 927875 | 6.6% |
| 3 | 926225 | 6.6% |
| T | 584201 | 4.2% |
| Z | 584201 | 4.2% |
| Other values (5) | 2098000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9929524 | |
| Other Punctuation | 1751972 | 12.5% |
| Dash Punctuation | 1168402 | 8.3% |
| Uppercase Letter | 1168402 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2668002 | |
| 0 | 1480784 | |
| 1 | 1472907 | |
| 4 | 939301 | 9.5% |
| 5 | 927875 | 9.3% |
| 3 | 926225 | 9.3% |
| 7 | 448284 | 4.5% |
| 9 | 373157 | 3.8% |
| 6 | 352898 | 3.6% |
| 8 | 340091 | 3.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1168402 | |
| . | 583570 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 584201 | |
| Z | 584201 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1168402 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12849898 | |
| Latin | 1168402 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2668002 | |
| 0 | 1480784 | |
| 1 | 1472907 | |
| - | 1168402 | |
| : | 1168402 | |
| 4 | 939301 | 7.3% |
| 5 | 927875 | 7.2% |
| 3 | 926225 | 7.2% |
| . | 583570 | 4.5% |
| 7 | 448284 | 3.5% |
| Other values (3) | 1066146 | 8.3% |
Latin
| Value | Count | Frequency (%) |
| T | 584201 | |
| Z | 584201 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14018300 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2668002 | |
| 0 | 1480784 | |
| 1 | 1472907 | |
| - | 1168402 | |
| : | 1168402 | |
| 4 | 939301 | 6.7% |
| 5 | 927875 | 6.6% |
| 3 | 926225 | 6.6% |
| T | 584201 | 4.2% |
| Z | 584201 | 4.2% |
| Other values (5) | 2098000 |
elevation
Text
Missing 
| Distinct | 1604 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 332110 |
| Missing (%) | 56.8% |
| Memory size | 4.5 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.180430876 |
| Min length | 3 |
Unique
| Unique | 190 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 1317.0 |
|---|---|
| 2nd row | 1326.0 |
| 3rd row | 2200.0 |
| 4th row | 40.0 |
| 5th row | 9.0 |
| Value | Count | Frequency (%) |
| 1067.0 | 4286 | 1.7% |
| 373.0 | 4059 | 1.6% |
| 1036.0 | 2829 | 1.1% |
| 200.0 | 2818 | 1.1% |
| 3.0 | 2315 | 0.9% |
| 280.0 | 2242 | 0.9% |
| 6.0 | 2149 | 0.9% |
| 174.0 | 2077 | 0.8% |
| 1146.0 | 2023 | 0.8% |
| 152.0 | 2023 | 0.8% |
| Other values (1591) | 225270 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 353001 | |
| . | 252091 | |
| 1 | 176370 | |
| 2 | 80512 | 6.2% |
| 3 | 77490 | 5.9% |
| 5 | 76790 | 5.9% |
| 4 | 63616 | 4.9% |
| 7 | 61821 | 4.7% |
| 6 | 60632 | 4.6% |
| 9 | 53647 | 4.1% |
| Other values (2) | 49970 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1053844 | |
| Other Punctuation | 252091 | 19.3% |
| Dash Punctuation | 5 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 353001 | |
| 1 | 176370 | |
| 2 | 80512 | 7.6% |
| 3 | 77490 | 7.4% |
| 5 | 76790 | 7.3% |
| 4 | 63616 | 6.0% |
| 7 | 61821 | 5.9% |
| 6 | 60632 | 5.8% |
| 9 | 53647 | 5.1% |
| 8 | 49965 | 4.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 252091 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1305940 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 353001 | |
| . | 252091 | |
| 1 | 176370 | |
| 2 | 80512 | 6.2% |
| 3 | 77490 | 5.9% |
| 5 | 76790 | 5.9% |
| 4 | 63616 | 4.9% |
| 7 | 61821 | 4.7% |
| 6 | 60632 | 4.6% |
| 9 | 53647 | 4.1% |
| Other values (2) | 49970 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1305940 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 353001 | |
| . | 252091 | |
| 1 | 176370 | |
| 2 | 80512 | 6.2% |
| 3 | 77490 | 5.9% |
| 5 | 76790 | 5.9% |
| 4 | 63616 | 4.9% |
| 7 | 61821 | 4.7% |
| 6 | 60632 | 4.6% |
| 9 | 53647 | 4.1% |
| Other values (2) | 49970 | 3.8% |
Missing 
| Distinct | 136 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 333288 |
| Missing (%) | 57.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 3 |
| Mean length | 3.118419532 |
| Min length | 3 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 10.0 |
| 5th row | 0.0 |
| Value | Count | Frequency (%) |
| 0.0 | 220652 | |
| 38.0 | 4866 | 1.9% |
| 30.5 | 2706 | 1.1% |
| 15.0 | 2411 | 1.0% |
| 18.0 | 1878 | 0.7% |
| 20.0 | 1562 | 0.6% |
| 15.5 | 1561 | 0.6% |
| 61.0 | 1329 | 0.5% |
| 26.0 | 989 | 0.4% |
| 12.0 | 842 | 0.3% |
| Other values (126) | 12117 | 4.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 469334 | |
| . | 250913 | |
| 5 | 16435 | 2.1% |
| 1 | 11282 | 1.4% |
| 3 | 10235 | 1.3% |
| 8 | 7880 | 1.0% |
| 2 | 7575 | 1.0% |
| 6 | 3460 | 0.4% |
| 4 | 2449 | 0.3% |
| 7 | 1564 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 531539 | |
| Other Punctuation | 250913 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 469334 | |
| 5 | 16435 | 3.1% |
| 1 | 11282 | 2.1% |
| 3 | 10235 | 1.9% |
| 8 | 7880 | 1.5% |
| 2 | 7575 | 1.4% |
| 6 | 3460 | 0.7% |
| 4 | 2449 | 0.5% |
| 7 | 1564 | 0.3% |
| 9 | 1325 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 250913 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 782452 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 469334 | |
| . | 250913 | |
| 5 | 16435 | 2.1% |
| 1 | 11282 | 1.4% |
| 3 | 10235 | 1.3% |
| 8 | 7880 | 1.0% |
| 2 | 7575 | 1.0% |
| 6 | 3460 | 0.4% |
| 4 | 2449 | 0.3% |
| 7 | 1564 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 782452 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 469334 | |
| . | 250913 | |
| 5 | 16435 | 2.1% |
| 1 | 11282 | 1.4% |
| 3 | 10235 | 1.3% |
| 8 | 7880 | 1.0% |
| 2 | 7575 | 1.0% |
| 6 | 3460 | 0.4% |
| 4 | 2449 | 0.3% |
| 7 | 1564 | 0.2% |
distanceFromCentroidInMeters
Text
Missing 
| Distinct | 146 |
|---|---|
| Distinct (%) | 5.9% |
| Missing | 581727 |
| Missing (%) | 99.6% |
| Memory size | 4.5 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 17.05052546 |
| Min length | 3 |
Unique
| Unique | 48 ? |
|---|---|
| Unique (%) | 1.9% |
Sample
| 1st row | 818.1211019658687 |
|---|---|
| 2nd row | 4856.291022878801 |
| 3rd row | 1710.7413076448918 |
| 4th row | 3977.2558796326234 |
| 5th row | 4961.494346970892 |
| Value | Count | Frequency (%) |
| 2063.191632254214 | 334 | 13.5% |
| 4961.494346970892 | 245 | 9.9% |
| 1710.7413076448918 | 132 | 5.3% |
| 4852.601362825603 | 128 | 5.2% |
| 818.1211019658687 | 83 | 3.4% |
| 4878.72894658956 | 83 | 3.4% |
| 2259.882955420656 | 80 | 3.2% |
| 1971.0139476565842 | 69 | 2.8% |
| 3977.2558796326234 | 55 | 2.2% |
| 4128.7637113665405 | 53 | 2.1% |
| Other values (136) | 1212 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 5219 | |
| 2 | 4705 | |
| 1 | 4367 | |
| 6 | 4167 | |
| 3 | 3976 | |
| 8 | 3849 | |
| 9 | 3792 | |
| 5 | 3395 | |
| 7 | 3367 | |
| 0 | 2872 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 39709 | |
| Other Punctuation | 2474 | 5.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 5219 | |
| 2 | 4705 | |
| 1 | 4367 | |
| 6 | 4167 | |
| 3 | 3976 | |
| 8 | 3849 | |
| 9 | 3792 | |
| 5 | 3395 | |
| 7 | 3367 | |
| 0 | 2872 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2474 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 42183 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 5219 | |
| 2 | 4705 | |
| 1 | 4367 | |
| 6 | 4167 | |
| 3 | 3976 | |
| 8 | 3849 | |
| 9 | 3792 | |
| 5 | 3395 | |
| 7 | 3367 | |
| 0 | 2872 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42183 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 5219 | |
| 2 | 4705 | |
| 1 | 4367 | |
| 6 | 4167 | |
| 3 | 3976 | |
| 8 | 3849 | |
| 9 | 3792 | |
| 5 | 3395 | |
| 7 | 3367 | |
| 0 | 2872 |
issue
Text
| Distinct | 165 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 186 |
|---|---|
| Median length | 179 |
| Mean length | 68.60599385 |
| Min length | 28 |
Unique
| Unique | 31 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84 |
|---|---|
| 2nd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84 |
| 3rd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| 4th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;CONTINENT_DERIVED_FROM_COUNTRY;CONTINENT_INVALID |
| 5th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84 |
| Value | Count | Frequency (%) |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84 | 249666 | |
| occurrence_status_inferred_from_individual_count | 227050 | |
| occurrence_status_inferred_from_individual_count;coordinate_reprojected | 34842 | 6.0% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;geodetic_datum_invalid | 12098 | 2.1% |
| occurrence_status_inferred_from_individual_count;continent_derived_from_country;continent_invalid | 9004 | 1.5% |
| occurrence_status_inferred_from_individual_count;country_derived_from_coordinates;country_invalid;geodetic_datum_assumed_wgs84;continent_derived_from_coordinates;continent_invalid | 6887 | 1.2% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_derived_from_coordinates;continent_invalid | 6809 | 1.2% |
| occurrence_status_inferred_from_individual_count;taxon_match_higherrank | 5451 | 0.9% |
| occurrence_status_inferred_from_individual_count;country_invalid | 5419 | 0.9% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;taxon_match_higherrank | 4662 | 0.8% |
| Other values (155) | 22311 | 3.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| _ | 4053596 | |
| E | 3553564 | 8.9% |
| R | 3202871 | 8.0% |
| U | 2974631 | 7.4% |
| I | 2922553 | 7.3% |
| D | 2881308 | 7.2% |
| C | 2865419 | 7.1% |
| N | 2681132 | 6.7% |
| T | 2652470 | 6.6% |
| O | 2377822 | 5.9% |
| Other values (19) | 9914187 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 35006044 | |
| Connector Punctuation | 4053596 | 10.1% |
| Decimal Number | 577858 | 1.4% |
| Other Punctuation | 442055 | 1.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 3553564 | |
| R | 3202871 | |
| U | 2974631 | |
| I | 2922553 | |
| D | 2881308 | |
| C | 2865419 | |
| N | 2681132 | 7.7% |
| T | 2652470 | 7.6% |
| O | 2377822 | 6.8% |
| S | 2077060 | 5.9% |
| Other values (15) | 6817214 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 288929 | |
| 4 | 288929 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4053596 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 442055 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 35006044 | |
| Common | 5073509 | 12.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 3553564 | |
| R | 3202871 | |
| U | 2974631 | |
| I | 2922553 | |
| D | 2881308 | |
| C | 2865419 | |
| N | 2681132 | 7.7% |
| T | 2652470 | 7.6% |
| O | 2377822 | 6.8% |
| S | 2077060 | 5.9% |
| Other values (15) | 6817214 |
Common
| Value | Count | Frequency (%) |
| _ | 4053596 | |
| ; | 442055 | 8.7% |
| 8 | 288929 | 5.7% |
| 4 | 288929 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40079553 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| _ | 4053596 | |
| E | 3553564 | 8.9% |
| R | 3202871 | 8.0% |
| U | 2974631 | 7.4% |
| I | 2922553 | 7.3% |
| D | 2881308 | 7.2% |
| C | 2865419 | 7.1% |
| N | 2681132 | 6.7% |
| T | 2652470 | 6.6% |
| O | 2377822 | 5.9% |
| Other values (19) | 9914187 |
mediaType
Text
Missing 
| Distinct | 23 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 579082 |
| Missing (%) | 99.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 285 |
|---|---|
| Median length | 274 |
| Mean length | 32.18480172 |
| Min length | 10 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | StillImage;StillImage;StillImage;StillImage;StillImage |
|---|---|
| 2nd row | StillImage;StillImage |
| 3rd row | StillImage;StillImage;StillImage |
| 4th row | StillImage;StillImage;StillImage |
| 5th row | StillImage;StillImage;StillImage;StillImage |
| Value | Count | Frequency (%) |
| stillimage;stillimage | 2352 | |
| stillimage | 841 | 16.4% |
| stillimage;stillimage;stillimage | 690 | 13.5% |
| stillimage;stillimage;stillimage;stillimage | 366 | 7.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage | 268 | 5.2% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 188 | 3.7% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 118 | 2.3% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 110 | 2.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 58 | 1.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 41 | 0.8% |
| Other values (13) | 87 | 1.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 30886 | |
| S | 15443 | |
| t | 15443 | |
| i | 15443 | |
| I | 15443 | |
| m | 15443 | |
| a | 15443 | |
| g | 15443 | |
| e | 15443 | |
| ; | 10324 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 123544 | |
| Uppercase Letter | 30886 | 18.7% |
| Other Punctuation | 10324 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 30886 | |
| t | 15443 | |
| i | 15443 | |
| m | 15443 | |
| a | 15443 | |
| g | 15443 | |
| e | 15443 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 15443 | |
| I | 15443 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 10324 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 154430 | |
| Common | 10324 | 6.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 30886 | |
| S | 15443 | |
| t | 15443 | |
| i | 15443 | |
| I | 15443 | |
| m | 15443 | |
| a | 15443 | |
| g | 15443 | |
| e | 15443 |
Common
| Value | Count | Frequency (%) |
| ; | 10324 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 164754 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 30886 | |
| S | 15443 | |
| t | 15443 | |
| i | 15443 | |
| I | 15443 | |
| m | 15443 | |
| a | 15443 | |
| g | 15443 | |
| e | 15443 | |
| ; | 10324 | 6.3% |
hasCoordinate
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.278443549 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | true |
|---|---|
| 2nd row | true |
| 3rd row | false |
| 4th row | false |
| 5th row | true |
| Value | Count | Frequency (%) |
| true | 421534 | |
| false | 162667 | 27.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 584201 | |
| t | 421534 | |
| r | 421534 | |
| u | 421534 | |
| f | 162667 | 6.5% |
| a | 162667 | 6.5% |
| l | 162667 | 6.5% |
| s | 162667 | 6.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2499471 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 584201 | |
| t | 421534 | |
| r | 421534 | |
| u | 421534 | |
| f | 162667 | 6.5% |
| a | 162667 | 6.5% |
| l | 162667 | 6.5% |
| s | 162667 | 6.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2499471 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 584201 | |
| t | 421534 | |
| r | 421534 | |
| u | 421534 | |
| f | 162667 | 6.5% |
| a | 162667 | 6.5% |
| l | 162667 | 6.5% |
| s | 162667 | 6.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2499471 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 584201 | |
| t | 421534 | |
| r | 421534 | |
| u | 421534 | |
| f | 162667 | 6.5% |
| a | 162667 | 6.5% |
| l | 162667 | 6.5% |
| s | 162667 | 6.5% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.996222191 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 581994 | |
| true | 2207 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 584201 | |
| f | 581994 | |
| a | 581994 | |
| l | 581994 | |
| s | 581994 | |
| t | 2207 | 0.1% |
| r | 2207 | 0.1% |
| u | 2207 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2918798 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 584201 | |
| f | 581994 | |
| a | 581994 | |
| l | 581994 | |
| s | 581994 | |
| t | 2207 | 0.1% |
| r | 2207 | 0.1% |
| u | 2207 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2918798 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 584201 | |
| f | 581994 | |
| a | 581994 | |
| l | 581994 | |
| s | 581994 | |
| t | 2207 | 0.1% |
| r | 2207 | 0.1% |
| u | 2207 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2918798 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 584201 | |
| f | 581994 | |
| a | 581994 | |
| l | 581994 | |
| s | 581994 | |
| t | 2207 | 0.1% |
| r | 2207 | 0.1% |
| u | 2207 | 0.1% |
taxonKey
Text
| Distinct | 9012 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.999032867 |
| Min length | 1 |
Unique
| Unique | 1713 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 5225055 |
|---|---|
| 2nd row | 2431506 |
| 3rd row | 5224383 |
| 4th row | 2446249 |
| 5th row | 2467415 |
| Value | Count | Frequency (%) |
| 2431491 | 75714 | 13.0% |
| 2431539 | 13092 | 2.2% |
| 2431224 | 10137 | 1.7% |
| 2431506 | 9986 | 1.7% |
| 2431529 | 7074 | 1.2% |
| 2431516 | 6940 | 1.2% |
| 2431489 | 6103 | 1.0% |
| 2431484 | 5559 | 1.0% |
| 2431219 | 4681 | 0.8% |
| 2431510 | 4614 | 0.8% |
| Other values (9002) | 440301 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 867830 | |
| 4 | 749774 | |
| 1 | 546778 | |
| 3 | 448678 | |
| 5 | 328014 | 8.0% |
| 9 | 301296 | 7.4% |
| 6 | 239069 | 5.8% |
| 7 | 213079 | 5.2% |
| 8 | 210506 | 5.1% |
| 0 | 183818 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4088842 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 867830 | |
| 4 | 749774 | |
| 1 | 546778 | |
| 3 | 448678 | |
| 5 | 328014 | 8.0% |
| 9 | 301296 | 7.4% |
| 6 | 239069 | 5.8% |
| 7 | 213079 | 5.2% |
| 8 | 210506 | 5.1% |
| 0 | 183818 | 4.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4088842 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 867830 | |
| 4 | 749774 | |
| 1 | 546778 | |
| 3 | 448678 | |
| 5 | 328014 | 8.0% |
| 9 | 301296 | 7.4% |
| 6 | 239069 | 5.8% |
| 7 | 213079 | 5.2% |
| 8 | 210506 | 5.1% |
| 0 | 183818 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4088842 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 867830 | |
| 4 | 749774 | |
| 1 | 546778 | |
| 3 | 448678 | |
| 5 | 328014 | 8.0% |
| 9 | 301296 | 7.4% |
| 6 | 239069 | 5.8% |
| 7 | 213079 | 5.2% |
| 8 | 210506 | 5.1% |
| 0 | 183818 | 4.5% |
acceptedTaxonKey
Text
| Distinct | 8475 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.019563472 |
| Min length | 1 |
Unique
| Unique | 1520 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 5225055 |
|---|---|
| 2nd row | 2431506 |
| 3rd row | 5224383 |
| 4th row | 2446249 |
| 5th row | 2467415 |
| Value | Count | Frequency (%) |
| 2431491 | 75714 | 13.0% |
| 2431539 | 13092 | 2.2% |
| 2431224 | 10146 | 1.7% |
| 2431506 | 9986 | 1.7% |
| 2431516 | 8012 | 1.4% |
| 2431529 | 7074 | 1.2% |
| 2431489 | 6103 | 1.0% |
| 2431484 | 5929 | 1.0% |
| 2431219 | 4681 | 0.8% |
| 2431510 | 4614 | 0.8% |
| Other values (8465) | 438850 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 850954 | |
| 4 | 741903 | |
| 1 | 558280 | |
| 3 | 457912 | |
| 5 | 330769 | 8.1% |
| 9 | 302439 | 7.4% |
| 6 | 235990 | 5.8% |
| 8 | 216124 | 5.3% |
| 7 | 207914 | 5.1% |
| 0 | 198551 | 4.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4100836 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 850954 | |
| 4 | 741903 | |
| 1 | 558280 | |
| 3 | 457912 | |
| 5 | 330769 | 8.1% |
| 9 | 302439 | 7.4% |
| 6 | 235990 | 5.8% |
| 8 | 216124 | 5.3% |
| 7 | 207914 | 5.1% |
| 0 | 198551 | 4.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4100836 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 850954 | |
| 4 | 741903 | |
| 1 | 558280 | |
| 3 | 457912 | |
| 5 | 330769 | 8.1% |
| 9 | 302439 | 7.4% |
| 6 | 235990 | 5.8% |
| 8 | 216124 | 5.3% |
| 7 | 207914 | 5.1% |
| 0 | 198551 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4100836 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 850954 | |
| 4 | 741903 | |
| 1 | 558280 | |
| 3 | 457912 | |
| 5 | 330769 | 8.1% |
| 9 | 302439 | 7.4% |
| 6 | 235990 | 5.8% |
| 8 | 216124 | 5.3% |
| 7 | 207914 | 5.1% |
| 0 | 198551 | 4.8% |
kingdomKey
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 584201 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 584201 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 584201 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 584201 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 584201 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 584201 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 584201 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 584201 |
phylumKey
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 44 |
|---|---|
| 2nd row | 44 |
| 3rd row | 44 |
| 4th row | 44 |
| 5th row | 44 |
| Value | Count | Frequency (%) |
| 44 | 584196 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 1168392 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1168392 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 1168392 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1168392 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 1168392 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1168392 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 1168392 |
classKey
Text
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 203 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 3 |
| Mean length | 4.616760674 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 11592253 |
|---|---|
| 2nd row | 131 |
| 3rd row | 11592253 |
| 4th row | 11592253 |
| 5th row | 11592253 |
| Value | Count | Frequency (%) |
| 131 | 395161 | |
| 11592253 | 169110 | |
| 11418114 | 18909 | 3.2% |
| 11493978 | 804 | 0.1% |
| 11569602 | 14 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1224723 | |
| 3 | 565075 | |
| 5 | 338234 | 12.5% |
| 2 | 338234 | 12.5% |
| 9 | 170732 | 6.3% |
| 4 | 38622 | 1.4% |
| 8 | 19713 | 0.7% |
| 7 | 804 | < 0.1% |
| 6 | 28 | < 0.1% |
| 0 | 14 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2696179 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1224723 | |
| 3 | 565075 | |
| 5 | 338234 | 12.5% |
| 2 | 338234 | 12.5% |
| 9 | 170732 | 6.3% |
| 4 | 38622 | 1.4% |
| 8 | 19713 | 0.7% |
| 7 | 804 | < 0.1% |
| 6 | 28 | < 0.1% |
| 0 | 14 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2696179 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1224723 | |
| 3 | 565075 | |
| 5 | 338234 | 12.5% |
| 2 | 338234 | 12.5% |
| 9 | 170732 | 6.3% |
| 4 | 38622 | 1.4% |
| 8 | 19713 | 0.7% |
| 7 | 804 | < 0.1% |
| 6 | 28 | < 0.1% |
| 0 | 14 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2696179 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1224723 | |
| 3 | 565075 | |
| 5 | 338234 | 12.5% |
| 2 | 338234 | 12.5% |
| 9 | 170732 | 6.3% |
| 4 | 38622 | 1.4% |
| 8 | 19713 | 0.7% |
| 7 | 804 | < 0.1% |
| 6 | 28 | < 0.1% |
| 0 | 14 | < 0.1% |
orderKey
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 189040 |
| Missing (%) | 32.4% |
| Memory size | 4.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 953 |
|---|---|
| 2nd row | 953 |
| 3rd row | 952 |
| 4th row | 952 |
| 5th row | 953 |
| Value | Count | Frequency (%) |
| 953 | 237129 | |
| 952 | 157511 | |
| 775 | 521 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 395161 | |
| 9 | 394640 | |
| 3 | 237129 | |
| 2 | 157511 | 13.3% |
| 7 | 1042 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1185483 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 395161 | |
| 9 | 394640 | |
| 3 | 237129 | |
| 2 | 157511 | 13.3% |
| 7 | 1042 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1185483 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 395161 | |
| 9 | 394640 | |
| 3 | 237129 | |
| 2 | 157511 | 13.3% |
| 7 | 1042 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1185483 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 395161 | |
| 9 | 394640 | |
| 3 | 237129 | |
| 2 | 157511 | 13.3% |
| 7 | 1042 | 0.1% |
familyKey
Text
| Distinct | 159 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 587 |
| Missing (%) | 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 4.194695467 |
| Min length | 4 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 9115 |
|---|---|
| 2nd row | 6748 |
| 3rd row | 5789856 |
| 4th row | 5666 |
| 5th row | 8345926 |
| Value | Count | Frequency (%) |
| 6748 | 221371 | |
| 6735 | 41566 | 7.1% |
| 6172 | 38793 | 6.6% |
| 9115 | 26153 | 4.5% |
| 6727 | 25125 | 4.3% |
| 6746 | 20333 | 3.5% |
| 8345926 | 18373 | 3.1% |
| 5666 | 17255 | 3.0% |
| 5016 | 16259 | 2.8% |
| 6739 | 10435 | 1.8% |
| Other values (149) | 147951 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 544722 | |
| 7 | 457502 | |
| 4 | 315203 | |
| 8 | 289994 | |
| 5 | 197308 | 8.1% |
| 1 | 162565 | 6.6% |
| 3 | 151911 | 6.2% |
| 2 | 123166 | 5.0% |
| 9 | 117217 | 4.8% |
| 0 | 88495 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2448083 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 544722 | |
| 7 | 457502 | |
| 4 | 315203 | |
| 8 | 289994 | |
| 5 | 197308 | 8.1% |
| 1 | 162565 | 6.6% |
| 3 | 151911 | 6.2% |
| 2 | 123166 | 5.0% |
| 9 | 117217 | 4.8% |
| 0 | 88495 | 3.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2448083 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 544722 | |
| 7 | 457502 | |
| 4 | 315203 | |
| 8 | 289994 | |
| 5 | 197308 | 8.1% |
| 1 | 162565 | 6.6% |
| 3 | 151911 | 6.2% |
| 2 | 123166 | 5.0% |
| 9 | 117217 | 4.8% |
| 0 | 88495 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2448083 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 544722 | |
| 7 | 457502 | |
| 4 | 315203 | |
| 8 | 289994 | |
| 5 | 197308 | 8.1% |
| 1 | 162565 | 6.6% |
| 3 | 151911 | 6.2% |
| 2 | 123166 | 5.0% |
| 9 | 117217 | 4.8% |
| 0 | 88495 | 3.6% |
genusKey
Text
| Distinct | 1418 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1685 |
| Missing (%) | 0.3% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.007369755 |
| Min length | 7 |
Unique
| Unique | 98 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2461082 |
|---|---|
| 2nd row | 2431477 |
| 3rd row | 2449677 |
| 4th row | 2446193 |
| 5th row | 8782549 |
| Value | Count | Frequency (%) |
| 2431477 | 168423 | |
| 2431198 | 35846 | 6.2% |
| 8782549 | 18373 | 3.2% |
| 2427046 | 12991 | 2.2% |
| 2424035 | 9948 | 1.7% |
| 2422857 | 9476 | 1.6% |
| 2451143 | 8824 | 1.5% |
| 2463307 | 8233 | 1.4% |
| 5218343 | 7667 | 1.3% |
| 2428124 | 6800 | 1.2% |
| Other values (1408) | 295935 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 924908 | |
| 2 | 838523 | |
| 7 | 519504 | |
| 3 | 429420 | |
| 1 | 409884 | |
| 8 | 234519 | 5.7% |
| 5 | 214552 | 5.3% |
| 9 | 184654 | 4.5% |
| 6 | 179151 | 4.4% |
| 0 | 146790 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4081905 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 924908 | |
| 2 | 838523 | |
| 7 | 519504 | |
| 3 | 429420 | |
| 1 | 409884 | |
| 8 | 234519 | 5.7% |
| 5 | 214552 | 5.3% |
| 9 | 184654 | 4.5% |
| 6 | 179151 | 4.4% |
| 0 | 146790 | 3.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4081905 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 924908 | |
| 2 | 838523 | |
| 7 | 519504 | |
| 3 | 429420 | |
| 1 | 409884 | |
| 8 | 234519 | 5.7% |
| 5 | 214552 | 5.3% |
| 9 | 184654 | 4.5% |
| 6 | 179151 | 4.4% |
| 0 | 146790 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4081905 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 924908 | |
| 2 | 838523 | |
| 7 | 519504 | |
| 3 | 429420 | |
| 1 | 409884 | |
| 8 | 234519 | 5.7% |
| 5 | 214552 | 5.3% |
| 9 | 184654 | 4.5% |
| 6 | 179151 | 4.4% |
| 0 | 146790 | 3.6% |
speciesKey
Text
Missing 
| Distinct | 7286 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 15011 |
| Missing (%) | 2.6% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.029765105 |
| Min length | 7 |
Unique
| Unique | 1233 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 5225055 |
|---|---|
| 2nd row | 2431506 |
| 3rd row | 5224383 |
| 4th row | 2446249 |
| 5th row | 2467415 |
| Value | Count | Frequency (%) |
| 2431491 | 75714 | 13.3% |
| 2431539 | 13092 | 2.3% |
| 2431224 | 10146 | 1.8% |
| 2431506 | 9986 | 1.8% |
| 2431516 | 8012 | 1.4% |
| 2431529 | 7074 | 1.2% |
| 2431489 | 6103 | 1.1% |
| 2431484 | 5929 | 1.0% |
| 2431219 | 4681 | 0.8% |
| 2431510 | 4614 | 0.8% |
| Other values (7276) | 423839 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 839953 | |
| 4 | 732856 | |
| 1 | 542210 | |
| 3 | 452620 | |
| 5 | 325394 | 8.1% |
| 9 | 290547 | 7.3% |
| 6 | 220240 | 5.5% |
| 8 | 210077 | 5.3% |
| 0 | 193852 | 4.8% |
| 7 | 193523 | 4.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4001272 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 839953 | |
| 4 | 732856 | |
| 1 | 542210 | |
| 3 | 452620 | |
| 5 | 325394 | 8.1% |
| 9 | 290547 | 7.3% |
| 6 | 220240 | 5.5% |
| 8 | 210077 | 5.3% |
| 0 | 193852 | 4.8% |
| 7 | 193523 | 4.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4001272 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 839953 | |
| 4 | 732856 | |
| 1 | 542210 | |
| 3 | 452620 | |
| 5 | 325394 | 8.1% |
| 9 | 290547 | 7.3% |
| 6 | 220240 | 5.5% |
| 8 | 210077 | 5.3% |
| 0 | 193852 | 4.8% |
| 7 | 193523 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4001272 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 839953 | |
| 4 | 732856 | |
| 1 | 542210 | |
| 3 | 452620 | |
| 5 | 325394 | 8.1% |
| 9 | 290547 | 7.3% |
| 6 | 220240 | 5.5% |
| 8 | 210077 | 5.3% |
| 0 | 193852 | 4.8% |
| 7 | 193523 | 4.8% |
species
Text
Missing 
| Distinct | 7285 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 15011 |
| Missing (%) | 2.6% |
| Memory size | 4.5 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 31 |
| Mean length | 19.39421459 |
| Min length | 9 |
Unique
| Unique | 1233 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Carlia bicarinata |
|---|---|
| 2nd row | Plethodon montanus |
| 3rd row | Enhydris enhydris |
| 4th row | Gehyra mutilata |
| 5th row | Anolis richardii |
| Value | Count | Frequency (%) |
| plethodon | 165820 | 14.6% |
| cinereus | 77201 | 6.8% |
| desmognathus | 34836 | 3.1% |
| anolis | 18232 | 1.6% |
| glutinosus | 13098 | 1.2% |
| lithobates | 12881 | 1.1% |
| fuscus | 10914 | 1.0% |
| montanus | 10391 | 0.9% |
| eleutherodactylus | 9909 | 0.9% |
| anaxyrus | 9456 | 0.8% |
| Other values (6398) | 775642 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 936730 | 8.5% |
| o | 931837 | 8.4% |
| s | 908046 | 8.2% |
| a | 828057 | 7.5% |
| i | 769871 | 7.0% |
| n | 699609 | 6.3% |
| t | 680178 | 6.2% |
| l | 614875 | 5.6% |
| r | 614619 | 5.6% |
| u | 607807 | 5.5% |
| Other values (44) | 3447364 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9900052 | |
| Space Separator | 569190 | 5.2% |
| Uppercase Letter | 569190 | 5.2% |
| Dash Punctuation | 561 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 936730 | |
| o | 931837 | |
| s | 908046 | 9.2% |
| a | 828057 | 8.4% |
| i | 769871 | 7.8% |
| n | 699609 | 7.1% |
| t | 680178 | 6.9% |
| l | 614875 | 6.2% |
| r | 614619 | 6.2% |
| u | 607807 | 6.1% |
| Other values (16) | 2308423 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 206462 | |
| A | 55946 | 9.8% |
| D | 52070 | 9.1% |
| L | 36458 | 6.4% |
| S | 33536 | 5.9% |
| E | 33482 | 5.9% |
| C | 30807 | 5.4% |
| H | 18383 | 3.2% |
| T | 17582 | 3.1% |
| B | 13484 | 2.4% |
| Other values (16) | 70980 | 12.5% |
Space Separator
| Value | Count | Frequency (%) |
| 569190 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 561 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10469242 | |
| Common | 569751 | 5.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 936730 | 8.9% |
| o | 931837 | 8.9% |
| s | 908046 | 8.7% |
| a | 828057 | 7.9% |
| i | 769871 | 7.4% |
| n | 699609 | 6.7% |
| t | 680178 | 6.5% |
| l | 614875 | 5.9% |
| r | 614619 | 5.9% |
| u | 607807 | 5.8% |
| Other values (42) | 2877613 |
Common
| Value | Count | Frequency (%) |
| 569190 | ||
| - | 561 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11038993 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 936730 | 8.5% |
| o | 931837 | 8.4% |
| s | 908046 | 8.2% |
| a | 828057 | 7.5% |
| i | 769871 | 7.0% |
| n | 699609 | 6.3% |
| t | 680178 | 6.2% |
| l | 614875 | 5.6% |
| r | 614619 | 5.6% |
| u | 607807 | 5.5% |
| Other values (44) | 3447364 |
| Distinct | 8475 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 182 |
|---|---|
| Median length | 112 |
| Mean length | 35.64721046 |
| Min length | 5 |
Unique
| Unique | 1520 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Carlia bicarinata (Macleay, 1877) |
|---|---|
| 2nd row | Plethodon montanus Highton & Peabody, 2000 |
| 3rd row | Enhydris enhydris (Schneider, 1799) |
| 4th row | Gehyra mutilata (Wiegmann, 1834) |
| 5th row | Anolis richardii Duméril & Bibron, 1837 |
| Value | Count | Frequency (%) |
| plethodon | 168423 | 6.7% |
| green | 95577 | 3.8% |
| 1818 | 93564 | 3.7% |
| 82003 | 3.3% | |
| cinereus | 77201 | 3.1% |
| desmognathus | 35846 | 1.4% |
| cope | 33460 | 1.3% |
| duméril | 27066 | 1.1% |
| linnaeus | 26934 | 1.1% |
| bibron | 23993 | 1.0% |
| Other values (8473) | 1852441 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1932307 | 9.3% | |
| e | 1568353 | 7.5% |
| o | 1194544 | 5.7% |
| n | 1166456 | 5.6% |
| a | 1137420 | 5.5% |
| i | 1101341 | 5.3% |
| s | 1056757 | 5.1% |
| r | 1050847 | 5.0% |
| t | 855218 | 4.1% |
| l | 825579 | 4.0% |
| Other values (78) | 8936314 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13876417 | |
| Decimal Number | 2316084 | 11.1% |
| Space Separator | 1932307 | 9.3% |
| Uppercase Letter | 1283545 | 6.2% |
| Other Punctuation | 673329 | 3.2% |
| Open Punctuation | 368234 | 1.8% |
| Close Punctuation | 368234 | 1.8% |
| Dash Punctuation | 6986 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1568353 | |
| o | 1194544 | 8.6% |
| n | 1166456 | 8.4% |
| a | 1137420 | 8.2% |
| i | 1101341 | 7.9% |
| s | 1056757 | 7.6% |
| r | 1050847 | 7.6% |
| t | 855218 | 6.2% |
| l | 825579 | 5.9% |
| u | 775518 | 5.6% |
| Other values (32) | 3144384 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 235206 | |
| G | 164279 | |
| D | 115545 | |
| B | 113754 | |
| L | 96736 | |
| S | 85411 | 6.7% |
| C | 82764 | 6.4% |
| H | 78166 | 6.1% |
| A | 63159 | 4.9% |
| E | 37002 | 2.9% |
| Other values (18) | 211523 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 726593 | |
| 8 | 564818 | |
| 9 | 223745 | 9.7% |
| 2 | 150515 | 6.5% |
| 0 | 136547 | 5.9% |
| 5 | 119543 | 5.2% |
| 7 | 114778 | 5.0% |
| 6 | 102349 | 4.4% |
| 3 | 97140 | 4.2% |
| 4 | 80056 | 3.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 590016 | |
| & | 82003 | 12.2% |
| ' | 797 | 0.1% |
| . | 513 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1932307 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 368234 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 368234 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6986 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15159962 | |
| Common | 5665174 | 27.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1568353 | 10.3% |
| o | 1194544 | 7.9% |
| n | 1166456 | 7.7% |
| a | 1137420 | 7.5% |
| i | 1101341 | 7.3% |
| s | 1056757 | 7.0% |
| r | 1050847 | 6.9% |
| t | 855218 | 5.6% |
| l | 825579 | 5.4% |
| u | 775518 | 5.1% |
| Other values (60) | 4427929 |
Common
| Value | Count | Frequency (%) |
| 1932307 | ||
| 1 | 726593 | 12.8% |
| , | 590016 | 10.4% |
| 8 | 564818 | 10.0% |
| ( | 368234 | 6.5% |
| ) | 368234 | 6.5% |
| 9 | 223745 | 3.9% |
| 2 | 150515 | 2.7% |
| 0 | 136547 | 2.4% |
| 5 | 119543 | 2.1% |
| Other values (8) | 484622 | 8.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20779927 | |
| None | 45209 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1932307 | 9.3% | |
| e | 1568353 | 7.5% |
| o | 1194544 | 5.7% |
| n | 1166456 | 5.6% |
| a | 1137420 | 5.5% |
| i | 1101341 | 5.3% |
| s | 1056757 | 5.1% |
| r | 1050847 | 5.1% |
| t | 855218 | 4.1% |
| l | 825579 | 4.0% |
| Other values (60) | 8891105 |
None
| Value | Count | Frequency (%) |
| é | 30067 | |
| ü | 10808 | 23.9% |
| è | 1828 | 4.0% |
| ö | 1268 | 2.8% |
| í | 442 | 1.0% |
| Ö | 294 | 0.7% |
| ñ | 248 | 0.5% |
| á | 137 | 0.3% |
| ó | 64 | 0.1% |
| å | 20 | < 0.1% |
| Other values (8) | 33 | 0.1% |
| Distinct | 9530 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 62 |
|---|---|
| Median length | 56 |
| Mean length | 19.84556343 |
| Min length | 4 |
Unique
| Unique | 1890 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Carlia bicarinata |
|---|---|
| 2nd row | Plethodon montanus |
| 3rd row | Enhydris enhydris |
| 4th row | Gehyra mutilata |
| 5th row | Anolis richardii |
| Value | Count | Frequency (%) |
| plethodon | 168423 | 14.0% |
| cinereus | 75774 | 6.3% |
| desmognathus | 35846 | 3.0% |
| anolis | 18352 | 1.5% |
| glutinosus | 13372 | 1.1% |
| lithobates | 12991 | 1.1% |
| fuscus | 11321 | 0.9% |
| montanus | 10417 | 0.9% |
| eleutherodactylus | 9959 | 0.8% |
| anaxyrus | 9474 | 0.8% |
| Other values (7195) | 837184 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 976778 | 8.4% |
| o | 954528 | 8.2% |
| s | 947788 | 8.2% |
| a | 896948 | 7.7% |
| i | 821046 | 7.1% |
| n | 729826 | 6.3% |
| t | 711935 | 6.1% |
| l | 642687 | 5.5% |
| u | 635392 | 5.5% |
| r | 633897 | 5.5% |
| Other values (49) | 3642973 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10381385 | |
| Space Separator | 618912 | 5.3% |
| Uppercase Letter | 582830 | 5.0% |
| Other Punctuation | 10078 | 0.1% |
| Dash Punctuation | 561 | < 0.1% |
| Open Punctuation | 16 | < 0.1% |
| Close Punctuation | 16 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 976778 | |
| o | 954528 | 9.2% |
| s | 947788 | 9.1% |
| a | 896948 | 8.6% |
| i | 821046 | 7.9% |
| n | 729826 | 7.0% |
| t | 711935 | 6.9% |
| l | 642687 | 6.2% |
| u | 635392 | 6.1% |
| r | 633897 | 6.1% |
| Other values (16) | 2430560 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 210261 | |
| A | 59585 | 10.2% |
| D | 48691 | 8.4% |
| L | 39048 | 6.7% |
| E | 33682 | 5.8% |
| S | 33117 | 5.7% |
| C | 32240 | 5.5% |
| H | 26213 | 4.5% |
| T | 17139 | 2.9% |
| R | 13689 | 2.3% |
| Other values (15) | 69165 | 11.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| " | 8496 | |
| . | 1545 | 15.3% |
| / | 21 | 0.2% |
| ? | 16 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 618912 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 561 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 16 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 16 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10964215 | |
| Common | 629583 | 5.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 976778 | 8.9% |
| o | 954528 | 8.7% |
| s | 947788 | 8.6% |
| a | 896948 | 8.2% |
| i | 821046 | 7.5% |
| n | 729826 | 6.7% |
| t | 711935 | 6.5% |
| l | 642687 | 5.9% |
| u | 635392 | 5.8% |
| r | 633897 | 5.8% |
| Other values (41) | 3013390 |
Common
| Value | Count | Frequency (%) |
| 618912 | ||
| " | 8496 | 1.3% |
| . | 1545 | 0.2% |
| - | 561 | 0.1% |
| / | 21 | < 0.1% |
| ( | 16 | < 0.1% |
| ? | 16 | < 0.1% |
| ) | 16 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11593798 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 976778 | 8.4% |
| o | 954528 | 8.2% |
| s | 947788 | 8.2% |
| a | 896948 | 7.7% |
| i | 821046 | 7.1% |
| n | 729826 | 6.3% |
| t | 711935 | 6.1% |
| l | 642687 | 5.5% |
| u | 635392 | 5.5% |
| r | 633897 | 5.5% |
| Other values (49) | 3642973 |
protocol
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EML |
|---|---|
| 2nd row | EML |
| 3rd row | EML |
| 4th row | EML |
| 5th row | EML |
| Value | Count | Frequency (%) |
| eml | 584201 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 584201 | |
| M | 584201 | |
| L | 584201 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1752603 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 584201 | |
| M | 584201 | |
| L | 584201 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1752603 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 584201 | |
| M | 584201 | |
| L | 584201 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1752603 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 584201 | |
| M | 584201 | |
| L | 584201 |
lastParsed
Text
| Distinct | 186736 |
|---|---|
| Distinct (%) | 32.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99567957 |
| Min length | 20 |
Unique
| Unique | 40383 ? |
|---|---|
| Unique (%) | 6.9% |
Sample
| 1st row | 2024-12-02T13:56:06.739Z |
|---|---|
| 2nd row | 2024-12-02T13:56:08.224Z |
| 3rd row | 2024-12-02T13:55:56.801Z |
| 4th row | 2024-12-02T13:59:51.499Z |
| 5th row | 2024-12-02T13:58:04.592Z |
| Value | Count | Frequency (%) |
| 2024-12-02t13:57:45.601z | 17 | < 0.1% |
| 2024-12-02t13:57:52.847z | 16 | < 0.1% |
| 2024-12-02t13:57:54.221z | 16 | < 0.1% |
| 2024-12-02t13:57:23.249z | 16 | < 0.1% |
| 2024-12-02t13:57:51.135z | 16 | < 0.1% |
| 2024-12-02t13:57:50.745z | 15 | < 0.1% |
| 2024-12-02t13:58:01.663z | 15 | < 0.1% |
| 2024-12-02t13:56:52.538z | 15 | < 0.1% |
| 2024-12-02t13:57:30.398z | 15 | < 0.1% |
| 2024-12-02t13:57:53.169z | 15 | < 0.1% |
| Other values (186726) | 584045 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2668002 | |
| 0 | 1480784 | |
| 1 | 1472907 | |
| - | 1168402 | |
| : | 1168402 | |
| 4 | 939301 | 6.7% |
| 5 | 927875 | 6.6% |
| 3 | 926225 | 6.6% |
| T | 584201 | 4.2% |
| Z | 584201 | 4.2% |
| Other values (5) | 2098000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9929524 | |
| Other Punctuation | 1751972 | 12.5% |
| Dash Punctuation | 1168402 | 8.3% |
| Uppercase Letter | 1168402 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2668002 | |
| 0 | 1480784 | |
| 1 | 1472907 | |
| 4 | 939301 | 9.5% |
| 5 | 927875 | 9.3% |
| 3 | 926225 | 9.3% |
| 7 | 448284 | 4.5% |
| 9 | 373157 | 3.8% |
| 6 | 352898 | 3.6% |
| 8 | 340091 | 3.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1168402 | |
| . | 583570 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 584201 | |
| Z | 584201 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1168402 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12849898 | |
| Latin | 1168402 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2668002 | |
| 0 | 1480784 | |
| 1 | 1472907 | |
| - | 1168402 | |
| : | 1168402 | |
| 4 | 939301 | 7.3% |
| 5 | 927875 | 7.2% |
| 3 | 926225 | 7.2% |
| . | 583570 | 4.5% |
| 7 | 448284 | 3.5% |
| Other values (3) | 1066146 | 8.3% |
Latin
| Value | Count | Frequency (%) |
| T | 584201 | |
| Z | 584201 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14018300 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2668002 | |
| 0 | 1480784 | |
| 1 | 1472907 | |
| - | 1168402 | |
| : | 1168402 | |
| 4 | 939301 | 6.7% |
| 5 | 927875 | 6.6% |
| 3 | 926225 | 6.6% |
| T | 584201 | 4.2% |
| Z | 584201 | 4.2% |
| Other values (5) | 2098000 |
lastCrawled
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 24 |
| Min length | 24 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2024-12-02T11:48:23.416Z |
|---|---|
| 2nd row | 2024-12-02T11:48:23.416Z |
| 3rd row | 2024-12-02T11:48:23.416Z |
| 4th row | 2024-12-02T11:48:23.416Z |
| 5th row | 2024-12-02T11:48:23.416Z |
| Value | Count | Frequency (%) |
| 2024-12-02t11:48:23.416z | 584201 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2921005 | |
| 1 | 2336804 | |
| 4 | 1752603 | |
| 0 | 1168402 | 8.3% |
| - | 1168402 | 8.3% |
| : | 1168402 | 8.3% |
| T | 584201 | 4.2% |
| 8 | 584201 | 4.2% |
| 3 | 584201 | 4.2% |
| . | 584201 | 4.2% |
| Other values (2) | 1168402 | 8.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9931417 | |
| Other Punctuation | 1752603 | 12.5% |
| Dash Punctuation | 1168402 | 8.3% |
| Uppercase Letter | 1168402 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2921005 | |
| 1 | 2336804 | |
| 4 | 1752603 | |
| 0 | 1168402 | 11.8% |
| 8 | 584201 | 5.9% |
| 3 | 584201 | 5.9% |
| 6 | 584201 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1168402 | |
| . | 584201 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 584201 | |
| Z | 584201 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1168402 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12852422 | |
| Latin | 1168402 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2921005 | |
| 1 | 2336804 | |
| 4 | 1752603 | |
| 0 | 1168402 | 9.1% |
| - | 1168402 | 9.1% |
| : | 1168402 | 9.1% |
| 8 | 584201 | 4.5% |
| 3 | 584201 | 4.5% |
| . | 584201 | 4.5% |
| 6 | 584201 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| T | 584201 | |
| Z | 584201 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14020824 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2921005 | |
| 1 | 2336804 | |
| 4 | 1752603 | |
| 0 | 1168402 | 8.3% |
| - | 1168402 | 8.3% |
| : | 1168402 | 8.3% |
| T | 584201 | 4.2% |
| 8 | 584201 | 4.2% |
| 3 | 584201 | 4.2% |
| . | 584201 | 4.2% |
| Other values (2) | 1168402 | 8.3% |
repatriated
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10596 |
| Missing (%) | 1.8% |
| Memory size | 4.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.582658798 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | true |
|---|---|
| 2nd row | false |
| 3rd row | true |
| 4th row | true |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 334216 | |
| true | 239389 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 573605 | |
| f | 334216 | |
| a | 334216 | |
| l | 334216 | |
| s | 334216 | |
| t | 239389 | |
| r | 239389 | |
| u | 239389 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2628636 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 573605 | |
| f | 334216 | |
| a | 334216 | |
| l | 334216 | |
| s | 334216 | |
| t | 239389 | |
| r | 239389 | |
| u | 239389 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2628636 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 573605 | |
| f | 334216 | |
| a | 334216 | |
| l | 334216 | |
| s | 334216 | |
| t | 239389 | |
| r | 239389 | |
| u | 239389 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2628636 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 573605 | |
| f | 334216 | |
| a | 334216 | |
| l | 334216 | |
| s | 334216 | |
| t | 239389 | |
| r | 239389 | |
| u | 239389 |
isSequenced
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.998765836 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 583480 | |
| true | 721 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 584201 | |
| f | 583480 | |
| a | 583480 | |
| l | 583480 | |
| s | 583480 | |
| t | 721 | < 0.1% |
| r | 721 | < 0.1% |
| u | 721 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2920284 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 584201 | |
| f | 583480 | |
| a | 583480 | |
| l | 583480 | |
| s | 583480 | |
| t | 721 | < 0.1% |
| r | 721 | < 0.1% |
| u | 721 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2920284 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 584201 | |
| f | 583480 | |
| a | 583480 | |
| l | 583480 | |
| s | 583480 | |
| t | 721 | < 0.1% |
| r | 721 | < 0.1% |
| u | 721 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2920284 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 584201 | |
| f | 583480 | |
| a | 583480 | |
| l | 583480 | |
| s | 583480 | |
| t | 721 | < 0.1% |
| r | 721 | < 0.1% |
| u | 721 | < 0.1% |
gbifRegion
Text
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11409 |
| Missing (%) | 2.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 11.80906158 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | OCEANIA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | OCEANIA |
| 4th row | LATIN_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 335375 | |
| latin_america | 147208 | |
| asia | 39442 | 6.9% |
| oceania | 28187 | 4.9% |
| africa | 19937 | 3.5% |
| europe | 2643 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1287506 | |
| R | 840538 | |
| I | 717357 | |
| C | 530707 | |
| E | 516056 | |
| N | 510770 | 7.6% |
| T | 482583 | 7.1% |
| _ | 482583 | 7.1% |
| M | 482583 | 7.1% |
| O | 366205 | 5.4% |
| Other values (6) | 547248 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6281553 | |
| Connector Punctuation | 482583 | 7.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1287506 | |
| R | 840538 | |
| I | 717357 | |
| C | 530707 | |
| E | 516056 | |
| N | 510770 | 8.1% |
| T | 482583 | 7.7% |
| M | 482583 | 7.7% |
| O | 366205 | 5.8% |
| H | 335375 | 5.3% |
| Other values (5) | 211873 | 3.4% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 482583 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6281553 | |
| Common | 482583 | 7.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 1287506 | |
| R | 840538 | |
| I | 717357 | |
| C | 530707 | |
| E | 516056 | |
| N | 510770 | 8.1% |
| T | 482583 | 7.7% |
| M | 482583 | 7.7% |
| O | 366205 | 5.8% |
| H | 335375 | 5.3% |
| Other values (5) | 211873 | 3.4% |
Common
| Value | Count | Frequency (%) |
| _ | 482583 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6764136 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 1287506 | |
| R | 840538 | |
| I | 717357 | |
| C | 530707 | |
| E | 516056 | |
| N | 510770 | 7.6% |
| T | 482583 | 7.1% |
| _ | 482583 | 7.1% |
| M | 482583 | 7.1% |
| O | 366205 | 5.4% |
| Other values (6) | 547248 |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 584201 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 1168402 | |
| A | 1168402 | |
| N | 584201 | |
| O | 584201 | |
| T | 584201 | |
| H | 584201 | |
| _ | 584201 | |
| M | 584201 | |
| E | 584201 | |
| I | 584201 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7010412 | |
| Connector Punctuation | 584201 | 7.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 1168402 | |
| A | 1168402 | |
| N | 584201 | |
| O | 584201 | |
| T | 584201 | |
| H | 584201 | |
| M | 584201 | |
| E | 584201 | |
| I | 584201 | |
| C | 584201 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 584201 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7010412 | |
| Common | 584201 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 1168402 | |
| A | 1168402 | |
| N | 584201 | |
| O | 584201 | |
| T | 584201 | |
| H | 584201 | |
| M | 584201 | |
| E | 584201 | |
| I | 584201 | |
| C | 584201 |
Common
| Value | Count | Frequency (%) |
| _ | 584201 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7594613 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 1168402 | |
| A | 1168402 | |
| N | 584201 | |
| O | 584201 | |
| T | 584201 | |
| H | 584201 | |
| _ | 584201 | |
| M | 584201 | |
| E | 584201 | |
| I | 584201 |
level0Gid
Text
Missing 
| Distinct | 175 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 173676 |
| Missing (%) | 29.7% |
| Memory size | 4.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 14 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | PNG |
|---|---|
| 2nd row | USA |
| 3rd row | GRD |
| 4th row | USA |
| 5th row | USA |
| Value | Count | Frequency (%) |
| usa | 282827 | |
| ecu | 14871 | 3.6% |
| bra | 13519 | 3.3% |
| per | 12508 | 3.0% |
| hnd | 10032 | 2.4% |
| mex | 4961 | 1.2% |
| dom | 4618 | 1.1% |
| cub | 3855 | 0.9% |
| png | 3606 | 0.9% |
| hti | 3483 | 0.8% |
| Other values (165) | 56245 | 13.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 308853 | |
| A | 306743 | |
| S | 286617 | |
| R | 40442 | 3.3% |
| E | 40339 | 3.3% |
| P | 28015 | 2.3% |
| N | 27326 | 2.2% |
| C | 26870 | 2.2% |
| M | 25313 | 2.1% |
| B | 21622 | 1.8% |
| Other values (18) | 119435 | 9.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1231567 | |
| Decimal Number | 8 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 308853 | |
| A | 306743 | |
| S | 286617 | |
| R | 40442 | 3.3% |
| E | 40339 | 3.3% |
| P | 28015 | 2.3% |
| N | 27326 | 2.2% |
| C | 26870 | 2.2% |
| M | 25313 | 2.1% |
| B | 21622 | 1.8% |
| Other values (16) | 119427 | 9.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 6 | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1231567 | |
| Common | 8 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 308853 | |
| A | 306743 | |
| S | 286617 | |
| R | 40442 | 3.3% |
| E | 40339 | 3.3% |
| P | 28015 | 2.3% |
| N | 27326 | 2.2% |
| C | 26870 | 2.2% |
| M | 25313 | 2.1% |
| B | 21622 | 1.8% |
| Other values (16) | 119427 | 9.7% |
Common
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 6 | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1231575 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 308853 | |
| A | 306743 | |
| S | 286617 | |
| R | 40442 | 3.3% |
| E | 40339 | 3.3% |
| P | 28015 | 2.3% |
| N | 27326 | 2.2% |
| C | 26870 | 2.2% |
| M | 25313 | 2.1% |
| B | 21622 | 1.8% |
| Other values (18) | 119435 | 9.7% |
level0Name
Text
Missing 
| Distinct | 175 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 173676 |
| Missing (%) | 29.7% |
| Memory size | 4.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 13 |
| Mean length | 11.47883564 |
| Min length | 4 |
Unique
| Unique | 14 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Papua New Guinea |
|---|---|
| 2nd row | United States |
| 3rd row | Grenada |
| 4th row | United States |
| 5th row | United States |
| Value | Count | Frequency (%) |
| united | 283460 | |
| states | 283455 | |
| ecuador | 14871 | 2.0% |
| brazil | 13519 | 1.9% |
| peru | 12508 | 1.7% |
| honduras | 10032 | 1.4% |
| republic | 5833 | 0.8% |
| méxico | 4961 | 0.7% |
| dominican | 4618 | 0.6% |
| guinea | 3935 | 0.5% |
| Other values (203) | 92017 | 12.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 869782 | |
| e | 617316 | |
| a | 436923 | |
| i | 361412 | |
| n | 346814 | 7.4% |
| d | 321822 | 6.8% |
| 318684 | 6.8% | |
| s | 308278 | 6.5% |
| S | 286866 | 6.1% |
| U | 283929 | 6.0% |
| Other values (48) | 560523 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3669150 | |
| Uppercase Letter | 724199 | 15.4% |
| Space Separator | 318684 | 6.8% |
| Other Punctuation | 280 | < 0.1% |
| Dash Punctuation | 36 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 869782 | |
| e | 617316 | |
| a | 436923 | |
| i | 361412 | |
| n | 346814 | 9.5% |
| d | 321822 | 8.8% |
| s | 308278 | 8.4% |
| r | 74760 | 2.0% |
| u | 73108 | 2.0% |
| o | 63163 | 1.7% |
| Other values (20) | 195772 | 5.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 286866 | |
| U | 283929 | |
| P | 24491 | 3.4% |
| E | 17791 | 2.5% |
| B | 16468 | 2.3% |
| H | 13529 | 1.9% |
| M | 12279 | 1.7% |
| C | 12037 | 1.7% |
| R | 10730 | 1.5% |
| G | 10373 | 1.4% |
| Other values (13) | 35706 | 4.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 116 | |
| . | 104 | |
| ' | 60 |
Space Separator
| Value | Count | Frequency (%) |
| 318684 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 36 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4393349 | |
| Common | 319000 | 6.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 869782 | |
| e | 617316 | |
| a | 436923 | |
| i | 361412 | |
| n | 346814 | 7.9% |
| d | 321822 | 7.3% |
| s | 308278 | 7.0% |
| S | 286866 | 6.5% |
| U | 283929 | 6.5% |
| r | 74760 | 1.7% |
| Other values (43) | 485447 |
Common
| Value | Count | Frequency (%) |
| 318684 | ||
| , | 116 | < 0.1% |
| . | 104 | < 0.1% |
| ' | 60 | < 0.1% |
| - | 36 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4704801 | |
| None | 7548 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 869782 | |
| e | 617316 | |
| a | 436923 | |
| i | 361412 | |
| n | 346814 | 7.4% |
| d | 321822 | 6.8% |
| 318684 | 6.8% | |
| s | 308278 | 6.6% |
| S | 286866 | 6.1% |
| U | 283929 | 6.0% |
| Other values (44) | 552975 |
None
| Value | Count | Frequency (%) |
| é | 5812 | |
| ã | 838 | 11.1% |
| í | 838 | 11.1% |
| ô | 60 | 0.8% |
level1Gid
Text
Missing 
| Distinct | 1192 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 174349 |
| Missing (%) | 29.8% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.801640592 |
| Min length | 6 |
Unique
| Unique | 127 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | PNG.2_1 |
|---|---|
| 2nd row | USA.34_1 |
| 3rd row | GRD.4_1 |
| 4th row | USA.47_1 |
| 5th row | USA.29_1 |
| Value | Count | Frequency (%) |
| usa.47_1 | 68346 | 16.7% |
| usa.34_1 | 51010 | 12.4% |
| usa.21_1 | 30907 | 7.5% |
| usa.39_1 | 18483 | 4.5% |
| usa.49_1 | 17227 | 4.2% |
| usa.43_1 | 10691 | 2.6% |
| usa.5_1 | 8858 | 2.2% |
| usa.11_1 | 8403 | 2.1% |
| usa.10_1 | 7784 | 1.9% |
| usa.37_1 | 5139 | 1.3% |
| Other values (1182) | 183004 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 548162 | |
| _ | 409852 | |
| . | 409646 | |
| U | 308853 | |
| A | 306694 | |
| S | 286615 | |
| 4 | 178904 | 5.6% |
| 3 | 121201 | 3.8% |
| 7 | 85072 | 2.7% |
| 2 | 75286 | 2.4% |
| Other values (28) | 467233 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1229548 | |
| Decimal Number | 1148472 | |
| Connector Punctuation | 409852 | 12.8% |
| Other Punctuation | 409646 | 12.8% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 308853 | |
| A | 306694 | |
| S | 286615 | |
| E | 40339 | 3.3% |
| R | 39839 | 3.2% |
| P | 28004 | 2.3% |
| N | 27315 | 2.2% |
| C | 26851 | 2.2% |
| M | 25289 | 2.1% |
| B | 21595 | 1.8% |
| Other values (16) | 118154 | 9.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 548162 | |
| 4 | 178904 | 15.6% |
| 3 | 121201 | 10.6% |
| 7 | 85072 | 7.4% |
| 2 | 75286 | 6.6% |
| 9 | 49176 | 4.3% |
| 5 | 33537 | 2.9% |
| 8 | 25139 | 2.2% |
| 6 | 17641 | 1.5% |
| 0 | 14354 | 1.2% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 409852 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 409646 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1967970 | |
| Latin | 1229548 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 308853 | |
| A | 306694 | |
| S | 286615 | |
| E | 40339 | 3.3% |
| R | 39839 | 3.2% |
| P | 28004 | 2.3% |
| N | 27315 | 2.2% |
| C | 26851 | 2.2% |
| M | 25289 | 2.1% |
| B | 21595 | 1.8% |
| Other values (16) | 118154 | 9.6% |
Common
| Value | Count | Frequency (%) |
| 1 | 548162 | |
| _ | 409852 | |
| . | 409646 | |
| 4 | 178904 | 9.1% |
| 3 | 121201 | 6.2% |
| 7 | 85072 | 4.3% |
| 2 | 75286 | 3.8% |
| 9 | 49176 | 2.5% |
| 5 | 33537 | 1.7% |
| 8 | 25139 | 1.3% |
| Other values (2) | 31995 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3197518 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 548162 | |
| _ | 409852 | |
| . | 409646 | |
| U | 308853 | |
| A | 306694 | |
| S | 286615 | |
| 4 | 178904 | 5.6% |
| 3 | 121201 | 3.8% |
| 7 | 85072 | 2.7% |
| 2 | 75286 | 2.4% |
| Other values (28) | 467233 |
level1Name
Text
Missing 
| Distinct | 1147 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 174349 |
| Missing (%) | 29.8% |
| Memory size | 4.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 30 |
| Mean length | 9.58385466 |
| Min length | 3 |
Unique
| Unique | 118 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Central |
|---|---|
| 2nd row | North Carolina |
| 3rd row | Saint George |
| 4th row | Virginia |
| 5th row | Nevada |
| Value | Count | Frequency (%) |
| virginia | 85573 | 15.6% |
| carolina | 55602 | 10.1% |
| north | 51399 | 9.3% |
| maryland | 30907 | 5.6% |
| pennsylvania | 18483 | 3.4% |
| west | 17320 | 3.1% |
| tennessee | 10691 | 1.9% |
| california | 9457 | 1.7% |
| georgia | 8403 | 1.5% |
| de | 7912 | 1.4% |
| Other values (1285) | 254513 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 585330 | |
| i | 477589 | |
| n | 369556 | 9.4% |
| r | 331611 | 8.4% |
| o | 265471 | 6.8% |
| l | 174911 | 4.5% |
| e | 169093 | 4.3% |
| s | 148236 | 3.8% |
| 140408 | 3.6% | |
| t | 129070 | 3.3% |
| Other values (89) | 1136687 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3241955 | |
| Uppercase Letter | 539254 | 13.7% |
| Space Separator | 140408 | 3.6% |
| Dash Punctuation | 4354 | 0.1% |
| Other Punctuation | 1959 | < 0.1% |
| Modifier Symbol | 30 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 585330 | |
| i | 477589 | |
| n | 369556 | |
| r | 331611 | |
| o | 265471 | |
| l | 174911 | 5.4% |
| e | 169093 | 5.2% |
| s | 148236 | 4.6% |
| t | 129070 | 4.0% |
| g | 116837 | 3.6% |
| Other values (49) | 474251 |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 87515 | |
| C | 81820 | |
| N | 67344 | |
| M | 54002 | |
| P | 35817 | 6.6% |
| S | 25857 | 4.8% |
| T | 24573 | 4.6% |
| A | 22550 | 4.2% |
| W | 21290 | 3.9% |
| G | 18587 | 3.4% |
| Other values (20) | 99899 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 1416 | |
| / | 451 | 23.0% |
| ! | 44 | 2.2% |
| . | 38 | 1.9% |
| , | 10 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 140408 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4354 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 30 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3781209 | |
| Common | 146753 | 3.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 585330 | |
| i | 477589 | |
| n | 369556 | 9.8% |
| r | 331611 | 8.8% |
| o | 265471 | 7.0% |
| l | 174911 | 4.6% |
| e | 169093 | 4.5% |
| s | 148236 | 3.9% |
| t | 129070 | 3.4% |
| g | 116837 | 3.1% |
| Other values (79) | 1013505 |
Common
| Value | Count | Frequency (%) |
| 140408 | ||
| - | 4354 | 3.0% |
| ' | 1416 | 1.0% |
| / | 451 | 0.3% |
| ! | 44 | < 0.1% |
| . | 38 | < 0.1% |
| ` | 30 | < 0.1% |
| , | 10 | < 0.1% |
| [ | 1 | < 0.1% |
| ] | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3904533 | |
| None | 23265 | 0.6% |
| Latin Ext Additional | 164 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 585330 | |
| i | 477589 | |
| n | 369556 | 9.5% |
| r | 331611 | 8.5% |
| o | 265471 | 6.8% |
| l | 174911 | 4.5% |
| e | 169093 | 4.3% |
| s | 148236 | 3.8% |
| 140408 | 3.6% | |
| t | 129070 | 3.3% |
| Other values (52) | 1113258 |
None
| Value | Count | Frequency (%) |
| á | 7994 | |
| é | 4043 | |
| í | 3671 | |
| ã | 3233 | |
| ó | 1380 | 5.9% |
| ô | 910 | 3.9% |
| ú | 598 | 2.6% |
| ñ | 567 | 2.4% |
| ü | 278 | 1.2% |
| ï | 179 | 0.8% |
| Other values (20) | 412 | 1.8% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ệ | 59 | |
| ả | 39 | |
| ồ | 35 | |
| ẵ | 17 | 10.4% |
| ừ | 5 | 3.0% |
| ế | 5 | 3.0% |
| ị | 4 | 2.4% |
level2Gid
Text
Missing 
| Distinct | 4973 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 186113 |
| Missing (%) | 31.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 10.59813157 |
| Min length | 8 |
Unique
| Unique | 873 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | PNG.2.3_1 |
|---|---|
| 2nd row | USA.34.11_1 |
| 3rd row | USA.47.9_1 |
| 4th row | USA.29.5_1 |
| 5th row | BRA.19.34_2 |
| Value | Count | Frequency (%) |
| usa.34.87_1 | 9937 | 2.5% |
| usa.47.50_1 | 7933 | 2.0% |
| usa.21.10_1 | 6723 | 1.7% |
| usa.34.56_1 | 6344 | 1.6% |
| usa.34.44_1 | 5697 | 1.4% |
| per.1.4_1 | 4919 | 1.2% |
| usa.21.16_1 | 4431 | 1.1% |
| usa.43.78_1 | 3919 | 1.0% |
| usa.49.42_1 | 3487 | 0.9% |
| usa.47.53_1 | 3397 | 0.9% |
| Other values (4963) | 341301 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 795970 | |
| 1 | 660755 | |
| _ | 398088 | |
| A | 305944 | 7.3% |
| U | 305494 | 7.2% |
| S | 286408 | 6.8% |
| 4 | 250715 | 5.9% |
| 3 | 195227 | 4.6% |
| 2 | 179315 | 4.3% |
| 7 | 136265 | 3.2% |
| Other values (28) | 704808 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1830675 | |
| Uppercase Letter | 1194256 | |
| Other Punctuation | 795970 | |
| Connector Punctuation | 398088 | 9.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 305944 | |
| U | 305494 | |
| S | 286408 | |
| E | 40275 | 3.4% |
| R | 38370 | 3.2% |
| C | 26058 | 2.2% |
| N | 25849 | 2.2% |
| P | 23473 | 2.0% |
| B | 20673 | 1.7% |
| M | 19615 | 1.6% |
| Other values (16) | 102097 | 8.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 660755 | |
| 4 | 250715 | 13.7% |
| 3 | 195227 | 10.7% |
| 2 | 179315 | 9.8% |
| 7 | 136265 | 7.4% |
| 5 | 106767 | 5.8% |
| 9 | 82662 | 4.5% |
| 8 | 81137 | 4.4% |
| 6 | 74816 | 4.1% |
| 0 | 63016 | 3.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 795970 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 398088 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3024733 | |
| Latin | 1194256 | 28.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 305944 | |
| U | 305494 | |
| S | 286408 | |
| E | 40275 | 3.4% |
| R | 38370 | 3.2% |
| C | 26058 | 2.2% |
| N | 25849 | 2.2% |
| P | 23473 | 2.0% |
| B | 20673 | 1.7% |
| M | 19615 | 1.6% |
| Other values (16) | 102097 | 8.5% |
Common
| Value | Count | Frequency (%) |
| . | 795970 | |
| 1 | 660755 | |
| _ | 398088 | |
| 4 | 250715 | 8.3% |
| 3 | 195227 | 6.5% |
| 2 | 179315 | 5.9% |
| 7 | 136265 | 4.5% |
| 5 | 106767 | 3.5% |
| 9 | 82662 | 2.7% |
| 8 | 81137 | 2.7% |
| Other values (2) | 137832 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4218989 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 795970 | |
| 1 | 660755 | |
| _ | 398088 | |
| A | 305944 | 7.3% |
| U | 305494 | 7.2% |
| S | 286408 | 6.8% |
| 4 | 250715 | 5.9% |
| 3 | 195227 | 4.6% |
| 2 | 179315 | 4.3% |
| 7 | 136265 | 3.2% |
| Other values (28) | 704808 |
level2Name
Text
Missing 
| Distinct | 4138 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 186171 |
| Missing (%) | 31.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 28 |
| Mean length | 8.217365525 |
| Min length | 2 |
Unique
| Unique | 730 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Kairuku-Hiri |
|---|---|
| 2nd row | Buncombe |
| 3rd row | Augusta |
| 4th row | Elko |
| 5th row | Itatiaia |
| Value | Count | Frequency (%) |
| swain | 9937 | 2.0% |
| giles | 7933 | 1.6% |
| frederick | 7093 | 1.5% |
| macon | 6483 | 1.3% |
| madison | 6402 | 1.3% |
| de | 6373 | 1.3% |
| haywood | 5697 | 1.2% |
| la | 5624 | 1.1% |
| san | 5466 | 1.1% |
| prince | 5405 | 1.1% |
| Other values (4402) | 422665 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 366588 | 11.2% |
| e | 273177 | 8.4% |
| n | 252799 | 7.7% |
| o | 249950 | 7.6% |
| r | 207979 | 6.4% |
| i | 190585 | 5.8% |
| l | 143560 | 4.4% |
| s | 133215 | 4.1% |
| t | 117958 | 3.6% |
| u | 95083 | 2.9% |
| Other values (99) | 1239864 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2676771 | |
| Uppercase Letter | 482454 | 14.8% |
| Space Separator | 91048 | 2.8% |
| Dash Punctuation | 10022 | 0.3% |
| Other Punctuation | 8451 | 0.3% |
| Decimal Number | 1843 | 0.1% |
| Open Punctuation | 130 | < 0.1% |
| Close Punctuation | 21 | < 0.1% |
| Math Symbol | 18 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 366588 | |
| e | 273177 | |
| n | 252799 | 9.4% |
| o | 249950 | 9.3% |
| r | 207979 | 7.8% |
| i | 190585 | 7.1% |
| l | 143560 | 5.4% |
| s | 133215 | 5.0% |
| t | 117958 | 4.4% |
| u | 95083 | 3.6% |
| Other values (47) | 645877 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 53401 | 11.1% |
| S | 50551 | 10.5% |
| M | 47650 | 9.9% |
| P | 36378 | 7.5% |
| G | 29941 | 6.2% |
| A | 29893 | 6.2% |
| B | 27596 | 5.7% |
| L | 23560 | 4.9% |
| R | 22673 | 4.7% |
| H | 20772 | 4.3% |
| Other values (23) | 140039 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 903 | |
| 1 | 416 | |
| 0 | 167 | 9.1% |
| 7 | 139 | 7.5% |
| 3 | 77 | 4.2% |
| 2 | 64 | 3.5% |
| 5 | 26 | 1.4% |
| 6 | 22 | 1.2% |
| 9 | 21 | 1.1% |
| 4 | 8 | 0.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 6982 | |
| . | 652 | 7.7% |
| / | 455 | 5.4% |
| , | 362 | 4.3% |
Space Separator
| Value | Count | Frequency (%) |
| 91048 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 10022 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 130 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 21 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 18 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3159225 | |
| Common | 111533 | 3.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 366588 | 11.6% |
| e | 273177 | 8.6% |
| n | 252799 | 8.0% |
| o | 249950 | 7.9% |
| r | 207979 | 6.6% |
| i | 190585 | 6.0% |
| l | 143560 | 4.5% |
| s | 133215 | 4.2% |
| t | 117958 | 3.7% |
| u | 95083 | 3.0% |
| Other values (80) | 1128331 |
Common
| Value | Count | Frequency (%) |
| 91048 | ||
| - | 10022 | 9.0% |
| ' | 6982 | 6.3% |
| 8 | 903 | 0.8% |
| . | 652 | 0.6% |
| / | 455 | 0.4% |
| 1 | 416 | 0.4% |
| , | 362 | 0.3% |
| 0 | 167 | 0.1% |
| 7 | 139 | 0.1% |
| Other values (9) | 387 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3243339 | |
| None | 27250 | 0.8% |
| Latin Ext Additional | 159 | < 0.1% |
| IPA Ext | 10 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 366588 | 11.3% |
| e | 273177 | 8.4% |
| n | 252799 | 7.8% |
| o | 249950 | 7.7% |
| r | 207979 | 6.4% |
| i | 190585 | 5.9% |
| l | 143560 | 4.4% |
| s | 133215 | 4.1% |
| t | 117958 | 3.6% |
| u | 95083 | 2.9% |
| Other values (61) | 1212445 |
None
| Value | Count | Frequency (%) |
| ó | 7527 | |
| á | 5558 | |
| é | 5213 | |
| í | 4075 | |
| ñ | 1709 | 6.3% |
| ã | 1117 | 4.1% |
| ú | 495 | 1.8% |
| ô | 244 | 0.9% |
| â | 230 | 0.8% |
| ō | 202 | 0.7% |
| Other values (21) | 880 | 3.2% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ờ | 59 | |
| ị | 59 | |
| ả | 23 | 14.5% |
| ộ | 15 | 9.4% |
| ồ | 2 | 1.3% |
| ắ | 1 | 0.6% |
IPA Ext
| Value | Count | Frequency (%) |
| ə | 10 |
level3Gid
Text
Missing 
| Distinct | 1518 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 532468 |
| Missing (%) | 91.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 13 |
| Mean length | 11.70438598 |
| Min length | 11 |
Unique
| Unique | 345 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | ECU.18.4.5_1 |
|---|---|
| 2nd row | BOL.6.5.3_2 |
| 3rd row | PER.18.3.4_1 |
| 4th row | ECU.18.4.2_1 |
| 5th row | PER.1.4.3_1 |
| Value | Count | Frequency (%) |
| per.1.4.3_1 | 3333 | 6.4% |
| per.18.3.4_1 | 1833 | 3.5% |
| per.1.4.1_1 | 1584 | 3.1% |
| per.8.9.1_1 | 1099 | 2.1% |
| per.18.1.1_1 | 862 | 1.7% |
| cri.3.3.4_1 | 850 | 1.6% |
| pan.3.3.1_1 | 816 | 1.6% |
| per.8.11.5_1 | 790 | 1.5% |
| ecu.21.2.7_1 | 708 | 1.4% |
| mdg.6.2.3_1 | 683 | 1.3% |
| Other values (1508) | 39175 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 155199 | |
| 1 | 109811 | |
| _ | 51733 | 8.5% |
| E | 29941 | 4.9% |
| 2 | 26345 | 4.4% |
| 3 | 23876 | 3.9% |
| 4 | 23543 | 3.9% |
| R | 20134 | 3.3% |
| C | 19996 | 3.3% |
| U | 14987 | 2.5% |
| Other values (24) | 129938 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 243380 | |
| Other Punctuation | 155199 | |
| Uppercase Letter | 155191 | |
| Connector Punctuation | 51733 | 8.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 29941 | |
| R | 20134 | |
| C | 19996 | |
| U | 14987 | |
| P | 14622 | |
| M | 9107 | 5.9% |
| I | 8533 | 5.5% |
| T | 6123 | 3.9% |
| H | 6010 | 3.9% |
| A | 5737 | 3.7% |
| Other values (12) | 20001 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 109811 | |
| 2 | 26345 | 10.8% |
| 3 | 23876 | 9.8% |
| 4 | 23543 | 9.7% |
| 8 | 14100 | 5.8% |
| 5 | 13401 | 5.5% |
| 6 | 10712 | 4.4% |
| 7 | 9554 | 3.9% |
| 9 | 6983 | 2.9% |
| 0 | 5055 | 2.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 155199 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 51733 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 450312 | |
| Latin | 155191 | 25.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 29941 | |
| R | 20134 | |
| C | 19996 | |
| U | 14987 | |
| P | 14622 | |
| M | 9107 | 5.9% |
| I | 8533 | 5.5% |
| T | 6123 | 3.9% |
| H | 6010 | 3.9% |
| A | 5737 | 3.7% |
| Other values (12) | 20001 |
Common
| Value | Count | Frequency (%) |
| . | 155199 | |
| 1 | 109811 | |
| _ | 51733 | 11.5% |
| 2 | 26345 | 5.9% |
| 3 | 23876 | 5.3% |
| 4 | 23543 | 5.2% |
| 8 | 14100 | 3.1% |
| 5 | 13401 | 3.0% |
| 6 | 10712 | 2.4% |
| 7 | 9554 | 2.1% |
| Other values (2) | 12038 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 605503 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 155199 | |
| 1 | 109811 | |
| _ | 51733 | 8.5% |
| E | 29941 | 4.9% |
| 2 | 26345 | 4.4% |
| 3 | 23876 | 3.9% |
| 4 | 23543 | 3.9% |
| R | 20134 | 3.3% |
| C | 19996 | 3.3% |
| U | 14987 | 2.5% |
| Other values (24) | 129938 |
level3Name
Text
Missing 
| Distinct | 1463 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 532843 |
| Missing (%) | 91.2% |
| Memory size | 4.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 28 |
| Mean length | 10.63014525 |
| Min length | 3 |
Unique
| Unique | 325 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | Montalvo (Andoas) |
|---|---|
| 2nd row | Cobija |
| 3rd row | Tambopata |
| 4th row | Diez De Agosto |
| 5th row | Rio Santiago |
| Value | Count | Frequency (%) |
| de | 3839 | 4.4% |
| rio | 3728 | 4.3% |
| santiago | 3466 | 4.0% |
| el | 3141 | 3.6% |
| san | 1843 | 2.1% |
| tambopata | 1833 | 2.1% |
| cenepa | 1584 | 1.8% |
| santa | 1305 | 1.5% |
| cab | 1203 | 1.4% |
| en | 1101 | 1.3% |
| Other values (1721) | 64179 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 83102 | |
| o | 46946 | 8.6% |
| 35864 | 6.6% | |
| n | 35831 | 6.6% |
| i | 30929 | 5.7% |
| e | 29861 | 5.5% |
| r | 26273 | 4.8% |
| t | 20664 | 3.8% |
| l | 19562 | 3.6% |
| u | 17432 | 3.2% |
| Other values (86) | 199479 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 410361 | |
| Uppercase Letter | 85504 | 15.7% |
| Space Separator | 35864 | 6.6% |
| Open Punctuation | 3997 | 0.7% |
| Close Punctuation | 3090 | 0.6% |
| Other Punctuation | 2866 | 0.5% |
| Decimal Number | 2401 | 0.4% |
| Dash Punctuation | 1860 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 83102 | |
| o | 46946 | |
| n | 35831 | |
| i | 30929 | 7.5% |
| e | 29861 | 7.3% |
| r | 26273 | 6.4% |
| t | 20664 | 5.0% |
| l | 19562 | 4.8% |
| u | 17432 | 4.2% |
| s | 13867 | 3.4% |
| Other values (40) | 85894 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 11425 | |
| C | 8284 | 9.7% |
| T | 7232 | 8.5% |
| P | 6443 | 7.5% |
| R | 5549 | 6.5% |
| E | 5542 | 6.5% |
| A | 5386 | 6.3% |
| D | 5367 | 6.3% |
| M | 4809 | 5.6% |
| B | 4384 | 5.1% |
| Other values (18) | 21083 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 666 | |
| 3 | 451 | |
| 0 | 328 | |
| 2 | 261 | 10.9% |
| 4 | 212 | 8.8% |
| 9 | 172 | 7.2% |
| 6 | 142 | 5.9% |
| 5 | 127 | 5.3% |
| 7 | 24 | 1.0% |
| 8 | 18 | 0.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2573 | |
| ' | 182 | 6.4% |
| , | 81 | 2.8% |
| / | 30 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 35864 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3997 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3090 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1860 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 495865 | |
| Common | 50078 | 9.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 83102 | |
| o | 46946 | 9.5% |
| n | 35831 | 7.2% |
| i | 30929 | 6.2% |
| e | 29861 | 6.0% |
| r | 26273 | 5.3% |
| t | 20664 | 4.2% |
| l | 19562 | 3.9% |
| u | 17432 | 3.5% |
| s | 13867 | 2.8% |
| Other values (68) | 171398 |
Common
| Value | Count | Frequency (%) |
| 35864 | ||
| ( | 3997 | 8.0% |
| ) | 3090 | 6.2% |
| . | 2573 | 5.1% |
| - | 1860 | 3.7% |
| 1 | 666 | 1.3% |
| 3 | 451 | 0.9% |
| 0 | 328 | 0.7% |
| 2 | 261 | 0.5% |
| 4 | 212 | 0.4% |
| Other values (8) | 776 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 542171 | |
| None | 3721 | 0.7% |
| Latin Ext Additional | 51 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 83102 | |
| o | 46946 | 8.7% |
| 35864 | 6.6% | |
| n | 35831 | 6.6% |
| i | 30929 | 5.7% |
| e | 29861 | 5.5% |
| r | 26273 | 4.8% |
| t | 20664 | 3.8% |
| l | 19562 | 3.6% |
| u | 17432 | 3.2% |
| Other values (60) | 195707 |
None
| Value | Count | Frequency (%) |
| ñ | 1397 | |
| é | 875 | |
| à | 366 | 9.8% |
| á | 232 | 6.2% |
| í | 150 | 4.0% |
| ï | 133 | 3.6% |
| ã | 95 | 2.6% |
| â | 91 | 2.4% |
| ó | 89 | 2.4% |
| è | 71 | 1.9% |
| Other values (11) | 222 | 6.0% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ờ | 30 | |
| ậ | 12 | 23.5% |
| ọ | 5 | 9.8% |
| ệ | 2 | 3.9% |
| ớ | 2 | 3.9% |
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 23468 |
| Missing (%) | 4.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | LC |
|---|---|
| 2nd row | LC |
| 3rd row | LC |
| 4th row | LC |
| 5th row | LC |
| Value | Count | Frequency (%) |
| lc | 459843 | |
| ne | 34497 | 6.2% |
| nt | 23131 | 4.1% |
| vu | 21629 | 3.9% |
| en | 10407 | 1.9% |
| cr | 6915 | 1.2% |
| dd | 4133 | 0.7% |
| ex | 177 | < 0.1% |
| ew | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 466758 | |
| L | 459843 | |
| N | 68035 | 6.1% |
| E | 45082 | 4.0% |
| T | 23131 | 2.1% |
| V | 21629 | 1.9% |
| U | 21629 | 1.9% |
| D | 8266 | 0.7% |
| R | 6915 | 0.6% |
| X | 177 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1121466 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 466758 | |
| L | 459843 | |
| N | 68035 | 6.1% |
| E | 45082 | 4.0% |
| T | 23131 | 2.1% |
| V | 21629 | 1.9% |
| U | 21629 | 1.9% |
| D | 8266 | 0.7% |
| R | 6915 | 0.6% |
| X | 177 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1121466 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 466758 | |
| L | 459843 | |
| N | 68035 | 6.1% |
| E | 45082 | 4.0% |
| T | 23131 | 2.1% |
| V | 21629 | 1.9% |
| U | 21629 | 1.9% |
| D | 8266 | 0.7% |
| R | 6915 | 0.6% |
| X | 177 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1121466 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 466758 | |
| L | 459843 | |
| N | 68035 | 6.1% |
| E | 45082 | 4.0% |
| T | 23131 | 2.1% |
| V | 21629 | 1.9% |
| U | 21629 | 1.9% |
| D | 8266 | 0.7% |
| R | 6915 | 0.6% |
| X | 177 | < 0.1% |